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STREPTOCOCCUS PNEUMONIAE PROTEINS AND NUCLHC ACID MOLECULES 

The present invention relates to proteins derived from Streptococcus pneumoniae, 
nucleic acid molecules encoding such proteins, the use of the nucleic acid and/or 
proteins as antigens/immunogens and in detection/diagnosis, as well as methods for 
5 screening the proteins/nucleic acid sequences as potential anti-microbial targets. 

Streptococcus pneumoniae, commonly referred to as the pneumococcus, is an 
important pathogenic organism. The continuing significance of Streptoccocus 
pneumoniae infections in relation to human disease in developing and developed 

10 countries has been authoritatively reviewed (Fiber, G.R., Science, 265: 1385-1387 
(1994)). That indicates that on a global scale this organism is believed to be the 
most common bacterial cause of acute respiratory infections, and is estimated to 
result in 1 million childhood deaths each year, mostly in developing countries 
(Stansfield, S.K., Pediatr, Infect. Dis., 6: 622 (1987)). In the USA it has been 

15 suggested (Breiman et al. Arch. Intern. Med,, ISO: 1401 (1990)) that the 
pneumococcus is still the most common cause of bacterial pneumonia, and that 
disease rates are particularly high in young children, in the elderly, and in patients 
with j predisposing conditions such as asplenia, heart, lung and kidney disease, 
diabetes, alcoholism, or with immunosupressive disorders, especially AIDS. These 

20 groups are at higher risk of pneumococcal septicaemia and hence meningitis and 
therefore have a greater risk of dying from pneumococcal infection. The 
pneumococcus is also the leading cause of otitis media and sinusitis, which remain 
prevalent infections in children in developed countries, and which incur substantial 
costs. 

25 

The need for effective preventative strategies against pneumococcal infection is 
highlighted by the recent emergence of penicillin-resistant pneumococci. It has been 
reported that 6.6% of pneumoccal isolates in 13 US hospitals in 12 states were found 



to be resistant to penicillin and some isolates were also resistant to other antibiotics 
including third generation cyclosporins (Schappert, S.M., Vital and Health Statistics 
of the Centres for Disease Control/National Centre for Health Statistics, 214:1 
(1992)). The rates of penicillin resistance can be higher (up to 20%) in some 
hospitals (Breiman et aU J. Am. Med. Assoc., 271: 1831 (1994)). Since the 
development of penicillin resistance among pneumococci is both recent and sudden, 
coming after decades during which penicillin remained an effective treatment, these 
findings are regarded as alarming. 

For the reasons given above, there are therefore compelling grounds for considering 
improvements in the means of preventing, controlling, diagnosing or treating 
pneumococcal diseases. 

Various approaches have been taken in order to provide vaccines for the prevention 
of pneumococcal infections. Difficulties arise for instance in view of the variety of 
serotypes (at least 90) based on the strucmre of the polysaccharide capsule 
surrounding the organism. Vaccines against individual serotypes are not effective 
against other serotypes and this means tiiat vaccines must include polysaccharide 
antigens from a whole range of serotypes in order to be effective in a majority of 
cases. An additional problem arises because it ahs been found that the capsular 
polysaccharides (each of which determines the serotype and is the major protective 
antigen) when purified and used as a vaccine do not reliably induce protective 
antibody responses in children under two years of age, the age group which suffers 
the highest incidence of invasive pneumococcal infection and meningitis. 

A modification of the approach using capsule antigens relies on conjugating the 
polysaccharide to a protein in order to derive an enhanced immune response, 
particularly by giving the response T-cell dependent character. This approach has 
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been used in the development of a vaccine against Haemophilus influenzae. There 
are issues of cost concerning both the multi-polysaccharide vaccines and those based 
on conjugates. 

5 

A third approach is to look for other antigenic components which offer the potential 
to be vaccine candidates. In the present application we provide a group of proteins 
antigens which are secreted/exported proteins. 

10 Thus, in a first aspect the present invention provides a Streptococcus pneumoniae 
protein or polypeptide having a sequence selected from those shown in table 2 
herein. , 

A protein or polypeptide of the present invention may be provided in substantially pure 
15 form. For example, it may be provided in a form which is substantially free of other 
proteins. 

In a preferred embodiment, a protein or polypeptide having an amino acid sequence as 
' shown in Table 3 is provided. 

20 

The invention encompasses any protem coded for by a nucleic acid sequence as shown 
in Table 1 herein. 

As discussed herein, the proteins and polypeptides of the invention are useful as 
25 antigenic material. Such material can be "antigenic" and/or "immunogenic". 
Generally, "antigenic" is taken to mean that the protein or polypeptide is capable of 
being used to raise antibodies or indeed is capable of inducing an antibody response in 
a subject. " Immunogenic" is taken to mean that the protein or polypeptide is capable of 
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eiiciting a protective immune response in a subject. Thus, in tiie latter case, the protein 
or polypeptide may be capable of not only generating an antibody response and in 
addition non-antibody based immune responses, 

5 



The skilled person will appreciate that homologues or derivatives of the proteins or 
10 polypeptides of the invention will also find use in the context of the present invention, 
ie as antigenic/immunogenic material. Thus, for instance proteins or polypeptides 
which ^include one or more additions, deletions, substitutions or the like are 
encompassed by the present invention. In addition, it may be possible to replace one 
amino acid with another of similar "type". For instance replacing one hydrophobic 
15 amino acid with another. One can use a program such as the CLUSTAL program to 
compare amino acid sequences. This program compares amino acid sequences and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. It is 
possible to calculate amino acid identity or similarity (identity plus conservation of 
amino acid type) for an optimal alignment. A program like BLASTx will align the 
20 longest stretch of similar sequences and assign a value to the fit. It is thus possible to 
obtain a comparison where several regions of similarity are foimd, each having a 
different score. Both types of analysis are contemplated in the present invention. 

In the case of homologues and derivatives, the degree of identity with a protein or 
25 polypeptide as described herein is less important than that the homologue or derivative 
should retain its antigenicity or immunogenicity to streptoccocus pneumoniae. 
However, suitably, homologues or derivatives having at least 60% similarity (as 
discussed above) with the proteins or polypeptides described herein are provided. 
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Preferably, homologues or derivatives having at least 70% similarity, more preferably 
at least 80% similarity are provided. Most preferably, homologues or derivatives 
having at least 90% or even 95% similarity are provided. 

5 In an alternative approach, the homologues or derivatives could be fusion proteins, 
incorporating moieties which render purification easier, for example by effectively 
tagging the desired protein or polypeptide. It may be necessary to remove the "tag** or 
it may be the case that the fusion protein itself retains sufficient antigenicity to be 
useful. 

10 

In an additional aspect of the invention there are provided antigenic fragments of the 
proteins or polypeptides of the invention, or of homologues or derivatives thereof. 

For fragments of the proteins or polypeptides described herein, or of homologues or 
15 derivatives thereof, the simation is slightiy different. It is well known that is possible to 
screen an antigenic protein or polypeptide to identify epitopic regions, ie those regions 
which are responsible for the protein of polypeptide's antigenicity or immunogenicity. 
Methods for carrying out such screening are well known in the art. Thus, the fragments 
of the present invention should include one or more such epitopic regions or be 
20 sufficiently similar to such regions to retain their antigenic/immunogenic properties. 
Thus, for fragments according to the present invention the degree of identity is perhaps 
irrelevant, since they may be 100% identical to a particular part of a protein or 
polypeptide, homologue or derivative as described herein. The key issue, once again, is 
that the fragment retains the antigenic/immunogenic properties. 

25 

Thus, what is important for homologues, derivatives and fragments is that they possess 
at least a degree of the antigenicity/immunogenicity of the protein or polypeptide from 
which they are derived. 



Gene cloning techniques may be used to provide a protein of the invention in - 
substantially pure form. These techniques are disclosed, for example, in J- Sambrook 
ei al Molecular Cloning 2nd Edition, Cold Spring Harbor Laboratory Press (1989). 
Thus, in a fourth aspect, the present invention provides a nucleic acid molecule 
comprising or consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 1 or their RNA equivalents; 

(ii) a sequence which is complementary to any of the sequences of (i); 

,(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a 
protein as defined in Table 1 . 

In a fifth aspect the present invention provides a nucleic acid molecule comprising or 
consisting of a sequence which is: 

(i) any of the DNA sequences set out in Table 4 or their RNA equivalents; 



(ii) a sequence which is complementary to any of the sequences of (i); 




7 



(iii) a sequence which codes for the same protein or polypeptide, as those 
sequences of (i) or (ii); 

(iv) a sequence which is has substantial identity with any of those of (i), (ii) 
5 and (iii); 

(v) a sequence which codes for a homologue, derivative or fragment of a protein 
as defined in Table 4. 

10 The nucleic acid molecules of the invention may include a plurality of such sequences, 
and/or fragments. The skilled person will appreciate that the present invention can 
include* novel variants of those particular novel nucleic acid molecules which are 
exemplified herein.. . Such variants are encompassed by the present invention. These 
may occur in nature, for example because of strain variation. For example, additions, 

15 substitutions and/or deletions are included. In addition, and particularly when utilising 
microbial expression systems, "one may wish to engineer the nucleic acid sequence by 
making use of known preferred codon usage in the particular organism being used for 
expression. Thus, synthetic or non-namrally occurring variants are also included within 
the scope of the invention. 

20 

The term "RNA equivalent" when used above indicates that a given RNA molecule has 
a sequence which is complementary to that of a given DNA molecule (allowing for the 
fact that in RNA "U" replaces "T" in the genetic code). 

25 When comparing nucleic acid sequences for the purposes of determining the degree of 
homology or identity one can use programs such as BESTFIT and GAP (both from the 
Wisconsin Genetics Computer Group (GCG) software package) BESTFIT, for 
example, compares two sequences and produces an optimal alignment of the most 
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similar segments. GAP enables sequences to be aligned along their whole length and 
finds the optimal alignment by inserting spaces in either sequence as appropriate. 
Suitably, in the context of the present invention compare when discussing identity of 
nucleic acid sequences, the comparison is made by alignment of the sequences along 
5 their whole length. 

Preferably, sequences which have substantial identity have at least 50% sequence 
identity, desirably at least 75% sequence identity and more desirably at least 90 or at 
least 95% sequence identity with said sequences. In some cases the sequence identity 
10 may be 99% or above. 

Desirably, the term "substantial identity" indicates that said sequence has a greater 
degree of identity with any of the sequences described herein than with prior art nucleic 
acid sequences. 

15 

It should however be noted that where a nucleic acid sequence of the present invention 
codes for at least part of a novel gene product the present invention includes within its 
scope all possible sequence coding for the gene product or for a novel part thereof. 

20 The nucleic acid molecule may be in isolated or recombinant form. It may be 
incorporated into a vector and the vector may be incorporated into a host. Such vectors 
and suitable hosts form yet further aspects of the present invention. 

Therefore, for example, by using probes based upon the nucleic acid sequences 
25 provided herein, genes in Streptococcus pneumoniae can be identified. They can then 
be excised using restriction enzymes and cloned into a vector. The vector can be 
introduced into a suitable host for expression. 
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Nucleic acid molecules of the present invention may be obtained from S.pneumoniae by 
the use of appropriate probes complementary to pan of the sequences of the nucleic 
acid molecules. Restriction enzymes or sonication techniques can be used to obtain 
appropriately sized fragments for probing. 

5 

Alternatively PGR techniques may be used to amplify a desired nucleic acid sequence. 
Thus the sequence data provided herein can be used to design two primers for use in 
PGR so thai a desired sequence, including whole genes or fragments thereof, can be 
targeted and then amplified to a high degree. One primer will normally show a high 
10 degree of specificity for a first sequence located on one strand of a DNA molecule, and 
the other primer will normally show a high degree of specificity for a second sequence 
located pn the complementary strand of the DNA sequence and being spaced from the 
complementary sequence to the first sequence. 

15 Typically primers will be at least 15-25 nucleotides long. 

As a further alternative chemical synthesis may be used. This may be automated. 
Relatively short sequences may be chemically synthesised and ligated together to 
provide a longer sequence. 

20 

In yet a further aspect the present invention provides an immunogenic/antigenic 
composition comprisuag one or more proteins or polypeptides selected from those 
whose sequences are shown in Tables 2-4, or homologues or derivatives thereof, 
and/or fragments of any of these. In preferred embodiments, the 
25 immunogenic/ antigenic composition is a vaccine or is for use in a diagnostic assay. 

In the case of vaccines suitable additional excipients. diluents, adjuvants or the like 
may be included. Numerous examples of these are well known in the art. 
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It is also possible to utilise the nucleic acid sequences shown in Table 1 in the 
preparation of so-called DNA vaccines. Thus, the invention also provides a vaccine 
composition comprising one or more nucleic acid sequences as defined herein. The - 
5 use of such DNA vaccines is described in the art. See for instance, Donnelly et al , 
Ann, Rev, Immunol., 15:617-648 (1997). 

• As already discussed herein the proteins or polypeptides described herein, their 

1 

\ homologues or derivatives, and/or fragments of any of these, can be used in methods 

10 of detecting/diagnosing S.pneumoniae. Such methods can be based on the detection 
of antibodies against such proteins which may be present in a subject. Therefore the 
: 5; presen; invention provides a method for the detection/diagnosis of S.pneumoniae 

Jjv which comprises the step of bringing into contact a sample to be tested with at least 

□ one protein, or homologue, derivative or fragment thereof, as described herein, 

rg 15 Suitably, the sample is a biological sample, such as a tissue sample or a sample of 

blood or saliva obtained from a subject to be tested. 

m In an alternative approach, the proteins described herein, or homologues, derivatives 

i;!.: and/or fragments thereof, can be used to raise antibodies, which in turn can be used 

M'^ 20 to detect the antigens, and hence S, pneumoniae. Such antibodies form another aspect 

of the invention. Antibodies within the scope of the present invention may be 

monoclonal or polyclonal. 



Polyclonal antibodies can be raised by stimulating their production in a suitable animal 
25 host (e.g. a mouse, rat, guinea pig, rabbit, sheep, goat or monkey) when a protein as 
described herein, or a homologue, derivative or fragment thereof, is injected into the 
animal. If desired, an adjuvant may be administered together with the protein. Well* 
known adjuvants include Freund's adjuvant (complete and incomplete) and aluminium 
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hydroxide. The antibodies can then be purified by virtue of their binding to a protein as 
described herein. 

Monoclonal antibodies can be produced from hybridomas. These can be formed by 
5 fusing myeloma cells and spleen cells which produce the desired antibody in order to 
form an immortal cell line. Thus the well-known Kohler & Milstein technique {Nature 
256 (1975)) or subsequent variations upon this technique can be used. 

Techniques for producing monoclonal and polyclonal antibodies that bind to a 
10 particular polypeptide/protein are now well developed in the art. They are discussed in 
standard immunology textbooks, for example in Roitt et aU Immunology second edition 
(1989),, Churchill Livingstone, London. 

In addition to whole antibodies, the present invention includes derivatives thereof which 
15 are capable of binding to proteins etc as described herein. Thus the present invention 
includes antibody fragments and synthetic constructs. Examples of antibody fragments 
and synthetic constructs are given by Dougall et al in Tibtech 12 372-379 (September 
1994). 

20 Antibodv fragments include, for example. Fab, F(ab')2 and Fv fragments. Fab 
fragments (These are discussed in Roitt et al [supra] ). Fv fragments can be modified 
to produce a synthetic construct known as a single chain Fv (scFv) molecule. This 
includes a peptide linker covalently joining and Vj regions, which contributes to the 
stability of the molecule. Other synthetic constructs that can be used include CDR 

25 peptides. These are synthetic peptides comprising antigen-binding determinants. 
Peptide mimetics may also be used. These molecules are usually conformationally 
restricted organic rings that mimic the structure of a CDR loop and that include 
antigen-interactive side chains. 
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Synthetic constructs include chimaeric molecules. Thus, for example, humanised (or 
primatised) antibodies or derivatives thereof are within the scope of the present 
invention. An example of a humanised antibody is an antibody having human 
framework regions, but rodent hypervariable regions. Ways of producing chimaeric 
antibodies are discussed for example by Morrison etal 'm PNAS, 81, 6851-6855 (1984) 
and by Takeda et al in Nature, 314, 452-454 (1985), 

Synthetic constructs also include molecules comprising an additional moiety that 
provides the molecule with some desirable property in addition to antigen binding. For 
example the moiety may be a label (e.g. a fluorescent or radioactive label). 
Alternatively, it may be a pharmaceutically active agent. 

Antibodies, or derivatives thereof, find use in detection/diagnosis of S.pneumoniae. 
Thus, in another aspect the present invention provides a method for the 
detection/diagnosis of pneumoniae which comprises the step of bringing into contact 
a sample to be tested and antibodies capable of binding to one or more proteins 
described herein, or to homologues, derivatives and/or fragments thereof. 

In addition, so-called "Affibodies" may be utilised. These are binding proteins 
selected from combinatorial libraries of an alpha-helical bacterial receptor domain 
(Nord et al ,) Thus, Small protein domains, capable of specific binding to different 
target proteins can be selected using combinatorial approaches. 

It will also be clear that the nucleic acid sequences described herein may be used to 
detect/diagnose S.pneumoniae, Thus, in yet a further aspect, the present invention 
provides a method for the detection/diagnosis of S.pneumoniae which comprises the 
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step of bringing into contact a sample to be tested with at least one nucleic acid 
sequence as described herein. Suitably, the sample is a biological sample, such as a 
tissue sample or a sample of blood or saliva obtained from a subject to be tested. 
Such samples may be pre-treated before being used in the methods of the invention. 
5 Trhus, for example, a sample may be treated to extract DNA. Then, DNA probes 
based on the nucleic acid sequences described herein (ie usually fragments of such 
sequences) may be used to detect nucleic acid from S. pneumoniae. 

In additional aspects, . the present invention provides: 

10 

(a) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a protein or polypeptide of the invention, or a 
derivative, homologue or fragment thereof, or an immunogenic composition of the 
invention; 

15 

(b) a method of vaccinating a subject against S.pneumoniae which comprises the 
step of administering to a subject a nucleic acid molecule as defined herein; 

(c) a method for the prophylaxis or treatment of S.pneumoniae infection which 
20 comprises the step of administering to a subject a protein or polypeptide of the 

invention, or a derivative, homologue or fragment thereof, or an immunogenic 
composition of the invention; 

(d) a method for the prophylaxis or treatment of S.pneumoniae infection which 
25 comprises the step of administering to a subject a nucleic acid molecule as defined 

herein; 

(e) a kit for use in detecting/diagnosing S.pneumoniae infection comprising one 
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or more proteins or polypeptides of the invention, or homologues, derivatives or 
fragments thereof, or an antigenic composition of the invention; and 

(f) a kit for use in detecting/diagnosing S, pneumoniae infection comprising one 
or more nucleic acid molecules as defined herein. 

Given that we have identified a group of important proteins, such proteins are 
potential targets for anti-microbial therapy. It is necessary, however, to determine 
whether each individual protein is essential for the organism's viability. Thus, the 
present invention also provides a method of determining whether a protein or 
polypeptide as described herein represents a potential anti-microbial target which 
comprises inactivating said protein and determining whether S.pneumoniae is still 
viable, in vitro or in vivo. 

A suitable method for inactivating the protein is to effect selected gene knockouts, ie 
prevent expression of the protein and determine whether this results in a lethal 
change. Suitable methods for carrying out such gene knockouts are described in Li et 
P.M^.S., 94:13251-13256 (1997). 

In a final aspect the present invention provides the use of an agent capable of 
antagonising, inhibiting or otherwise interfering with the function or expression of a 
protein or polypeptide of the invention in the manufacture of a medicament for use in 
the treatment or prophylaxis of S.pneumoniae infection. 

The invention will now be described with reference to the following examples, 
which should not be construed as in any way limiting the invention. The examples 
refer to the figures in which: 
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Figure 1: shows the results of various DNA vaccine trials; and 
Figure 2: shows the results of further DNA vaccine trials. 

EXAMPLE 1 

5 

The Genome sequencing of Streptococcus pneumoniae type 4 is in progress at the 

Institute for Genomic Research (TIGR, Rockville, MD, USA), Up to now, the 
whole sequence has not been completed or published. On 21'^ November 1997, the 

10 TIGR centre released some DNA sequences as contigs which are not accurate 
reflections of the finished sequence. These contigs can be downloaded from their 
Webster (www@tigr.org) . We downloaded these contigs and created a local database 
using the application GCGToBLAST (Wisconsin Package Version 9.1, Genetics 
Computer Group (GCG), Madison, USA). This database can be searched with the 

15 Fast A and TfastA procedures (using the method of Pearson and Lipman {PNAS USA, 
85:2444-2448 (1988)). 

Using FastA and TfastA procedures, the local pneumococcus database was searched 
' for putative leader sequence or anchor sequence features. Relevant sequences were 
20 used to interrogate for comparative novel sequences. These were: 

(i) already described leader sequences of Streptococcus pneumoniae (from 
proteins NanA, NanB, LytA, Pap A, pep A. PsaA and PspA); 

25 (ii) the leader sequence of Usp45, a secreted protein from Laaococcus lactis; 

(iii) new hypothetical leader sequences derived from the searches in (i) and 

(ii) ; 
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(iv) the anchor motif LPxTG. a feature common to many Gram-positive 
bacteria surface proteins which are anchored by a mechanism involving the 
Sortase complex proteins. 

5 

Provided below is an example of this approach, with reference to the sequences 
derived from the database (see table 1). 
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15 



20 



Usp45 



NanA 



PapA PsaA 



PspA 



PrtP 





4121.1 4101.3 4172.4 4106.1 4346.1 4194.1 4172.2 4147.2 



10 4191.6 



4137.3 4193.1 4122.1 



4170.8 4175.3 



4187.2 4188.1 4102.1 



4136.1 4127.1 4191.1 



1 



4119.1 



4119.2 
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The protein leader sequences of different known exported proteins were used as a 
starling point for a search of the local pneumococcus database described above. The 
hypothetical proteins found with this search were then submitted to a Blast search in 
general databases such as EMBL, Swissprot etc. Proteins remaining unknown in the 
pneumococcus are kept and annotated. Then the search is performed again using the 
new potential protein leader sequence as a probe, using the TfastA procedure. 



18 



Example 2: DNA vaccine trials 
pcDNA3.l4- as a DNA vaccine vector 
5 pcDNA3.1 + 

The vector chosen for use as a DNA vaccine vector was pcDNAS.l (Invitrogen) 
(actually pcDNAS.l + , the forward orientation was used in all cases but may be 
referred to as pcDNAS.l here on). This vector has been widely and successfully 

10 employed as a host vector to test vaccine candidate genes to give protection against 
pathogens in the literature (Zhang, et uL, Kurar and Splitter, Anderson et al). The 
vector was designed for high-level stable and non-replicative transient expression in 
mammalian cells. pcDNAS.l contains the ColEl origin of replication which allows 
convenient high-copy number replication and growth in £. coli. This in turn allows 

15 rapid and efficient cloning and testing of many genes. The pcDNAS.l vector has a 
large number of cloning sites and also contains the gene encoding ampicillin 
resistance to aid in cloning selection and the human cytomegalovirus (CMV) 
inunedi^te-early promoter/enhancer which permits efficient, high-level expression -of 
the recombinant protein. The CMV promoter is a strong viral promoter in a wide 

20 range of cell types including both muscle and immune (antigen presenting) cells. 

This is important for optimal immune response as it remains unknown as to which 
cells types are most important in generatmg a protective response in vivo, A T7 
promoter upstream of the multiple cloning site affords efficient expression of the 
modified insert of interest and which allows in vitro transcription of a cloned gene in 

25 the sense orientation. 

Zhang, D., Yang, X., Berry, J. Shen, C, McClarty, G. and Brunham, R.C, (1997) 
- "DNA vaccination with the major outer-membrane protein genes induces acquired 
immunity to Chlamydia trachomatis (mouse pneumonitis) infection". Infection and 
30 Immunity, 176, 10S5-40. 

Kurar, E. and Splitter, G.A. (1997) "Nucleic acid vaccination of Brucella abortus 
ribosomal L7/L12 gene elicits immune response". Vaccine, 15, 1851-57. 

35 Anderson, R., Gao, X.-M., Papakonstantinopoulou, A., Roberts, M. and Dougan, 
G. (1996) "Immune response in mice following immunisation with DNA encoding 
fragment C of tetanus toxin". Infection and Immunity, 64, 3168-S17S. 

Preparation of DNA vaccines 

40 

Oligonucleotide primers were designed for each individual gene of interest derived 
using the LEEP system. Each gene was examined thoroughly, and where possible, 
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primers were designed such that they targeted that portion of the gene thought to 
encode only the mature portion of the gene protein. It was hoped that expressing 
those sequences that encode only the mature ponion of a target gene protein, would 
facilitate its correct folding when expressed in mammalian cells. For example, in the 
5 majority of cases primers were designed such that putative N-terminal signal peptide 
sequences would not be included in the final amplification product to be cloned into 
the pcDNAS.l expression vector. The signal peptide directs the polypeptide 
precursor to the cell membrane via the protein export pathway where it is normally 
cleaved off by signal peptidase I (or signal peptidase II if a lipoprotein). Hence the 
10 signal peptide does not make up any part of the mamre protein whether it be 

displayed on the surface of the bacteria surface or secreted. Where a N-terminal 
leader peptide sequence was not immediately obvious, primers were designed to 
target the whole of the gene sequence for cloning.and ultimately, expression in 
pcDNAS.l. 

15 

Having said that, however, other additional feamres of proteins may also affect the 
expression and presentation of a soluble protein. DNA sequences encoding such 
features in the genes encoding the proteins of interest were excluded during the 
20 design of oligonucleotides. These features included: 

1. LPXTG cell wall anchoring motifs. 

2. LXXC ipoprotein attachment sites. 

3. Hydrophobic C-terminal domain. 

25 4. Where no N-terminal signal peptide or LXXC was present the start codon was 
excluded. 

5. Where no hydrophobic C-terminal domain or LPXTG motif was present the stop 
codon was removed. 

30 Appropriate PCR primers were designed for each gene of interest and any and all of 
the regions encoding the above features was removed from the gene when designing 
these primers. The primers were designed with the appropriate enzyme restriction 
site followed by a conserved Kozak nucleotide sequence (in all cases) GCCACC was 
used. The Kozak sequence facilitates the recognition of initiator sequences by 

35 eukaryotic ribosomes) and an ATG start codon upstream of the insert of the gene of 
interest. For example the forward primer using a BamHl site the primer would 
begin GCGGGATCCGCCACCATG followed by a small section of the 5* end of the 
gene of interest. The reverse primer was designed to be compatible with the forward 
primer and with a Notl restriction site at the 5' end in all cases (this site is 

40 TTGCGGCCGC). 

PCR primers 
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The following PGR primers were designed and used to amplify the truncated genes 
of interest. 



ID210 

Forward Primer 5" CGGATCCGCCACCATGTCTTCTAATGAATCTGCCGATG 
3" 

Reverse Primer 5 ' TTGCGGCCGCCTGTTTAGATTGGATATCTGTAAAGACTT 

3' 

4172.5 

Forward Primer 5' 

CGCGGATCCGCCACCATGGATTTTCCTTCAAATTTGGAGG 3 ' 
Reverse Primer 5" TTGCGGCCGCACCGTACTGGCTGCTGACT 3* 

ID211 

Forward Primer 5' 

CGGATCCGCC ACC ATG AGTGAG ATCAAAATTATTAACGC 3 ' 
Reverse Primer 5" TTGCGGCCGCCGTTCCATGGTTGACTCCT 3" 

4197.4 

Forward Primer 5' CGCGGATCCGCCACCATGTGGGACATATTGGTGGAAAC 
3' 

Reverse Primer 5' TTGCGGCCGCTTCACTTGAGCAAACTGAATCC 3' 
4122.1 

Forward Primer 5' 

CGCGGATCCGCCACC ATGTCACAAGAAAAAACAAAAAATGAA 3 ' 
Reverse Primer 5" TTGCGGCCGCATCGACGTAGTCTCCGCC 3' 

4126.7 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTGGTTGGAACTTTCTACTATCAAT 3' , 
Reverse Primer 5' TTGCGGCCGCAACTTTCGTCCCTTTTTGG 3' 
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4188.11 

Forward Primer 5' CGCGGATCCGCCACCATGGGCAATTCTGGCGGAA 3' 
Reverse Primer 5' TTGCGGCCGCTTGTTTCATAGCTTTTTTGATTGTT 3' 

ID209 

Forward Primer 5' 

CGCGGATCCGCCACCATGCTATTGATACGAA ATGCAGGG 3 ' 

Reverse Primer 5' TTGCGGCCGCAACATAATCTAGTAAATAAGCGTAGCC 3' 

ID215 

Forward Primer 5' CGCGGATCCGCCACCATGACGGCGACGAATTTTC 3' 
Reverse Primer 5' TTGCGGCCGCTTAATTCGTTTTTGAACTAGTTGCT 3' 

4170.4 

Forward Primer 5' 

CGCGG ATCCGCCACC ATGGCTGTTTTTCTTCGCT ATC ATG 3 ' 
Reverse Primer 5' TTGCGGCCGCTTTCTTCAACAAACCTTGTTCTTG 3* 

4193.1 

Forward Primer 5' 

CGCGGATCCGCCACCATGGGTAACCGCTCTTCTCGTAAC 3* 
Reverse Primer 5' TTGCGGCCGCGCTTCCATCAAGGATTTTAGC 3' 

Cloning 

The insert along with the flanking features described above was amplified using PGR 
against a template of genomic DNA isolated from type 4 5. pneumoniae strain 11886 
obtained from the National Collection of Type Cultures. The PGR product was cut 
with the appropriate restriction enzymes and cloned in to the multiple cloning site of 
pcDNA3,l using conventional molecular biological techniques. Suitably mapped 
clones of the genes of interested were culmred and the plasmids isolated on a large 
scale (> 1.5 mg) using Plasmid Mega Kits (Qiagen). Successful cloning and 
maintenance of genes was confirmed by restriction mapping and sequencing — 700 
base pairs through the 5' cloning junction of each large scale preparation of each 
construct. 
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A strain of type 4 was used in cloning and challenge methods which is the strain 
from which the 5. pneumoniae genome was sequenced. A freeze dried ampoule of a 
homogeneous laboratory strain of type 4 S. pneumoniae strain NCTC 11886 was 

5 obtained from the National Collection of Type Strains, The ampoule was opened and 
the cultured re suspended with 0.5 nal of tryptic soy broth (0.5% glucose, 5% 
blood). The suspension was subcultured into 10 ml tryptic soy broth (0.5% glucose, 
5% blood) and incubated statically overnight at 37°C. This culture was streaked on 
to 5 % blood agar plates to check for contaminants and confirm viability and on to 

10 blood agar slopes and the rest of the culture was used to make 20% glycerol stocks. 
The slopes were sent to the Public Health Laboratory Service where the type 4 
serotype was confirmed. 

A glycerol stock of NCTC i 1886 was streaked on "a 5% blood agar plate and 
15 incubated overnight in a C02 gas jar at 37°C. Fresh streaks were made and optochin 
sensitivity was confirmed. 

Pneumococcal challenge 

20 A standard inoculum of type 4 S. pneumoniae was prepared and frozen down by 

passaging a culture of pneumococcus Ix through mice, harvesting from the blood of 
infected animals, and grown up to a predetermined viable count of around 10^ 
cfu/ml in broth before freezing down. The preparation is set out below as per the 
flow chart. 

25 

Streak pneumococcal culture and confirm identity 



V 

30 

Grow over-night culture from 4-5 colonies on plate above 



35 

Animal passage pneumococcal culture 
(i.p. injection of cardiac bleed to harvest) 

! 

40 V 



Grow over-night culture from animal passaged pneumococcus 
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I 

V 

5 Grow day culture (to pre-determined optical density) from over-night of animal 

passage and freeze down at -70'*C - This is standard minimum 



10 

Thaw one aliquot of standard inoculum to viable count 



15 . - 

Use standard inoculum to determine effective dose (called Virulence Testing) 

1 

V 

20 

All subsequent challenges - use standard inoculum to effective dose 

An aliquot of standard inoculum was diluted 500x in PBS and used to inoculate the 
mice. 

25 

Mice were lightly anaesthetised using halothane and then a dose of 1.4 x 10^ cfu of 
pneumococcus was applied to the nose of each mouse. The uptake was facilitated by 
the normal breathing of the mouse, which was left to recover on its back. 

30 5. pneumoniae vaccine trials 

Vaccine trials in mice were carried out by the administration of DNA to 6 week old 
CBA/ca mice (Harlan, UK). Mice to be vaccinated were divided into groups of six 
and each group was inmiunised with recombinant pcDNA3.1 4- plasmid DNA 

35 containing a specific target-gene sequence of interest. A total of 100 ftg of DNA in 
Dulbecco's PBS (Sigma) was injected intramuscularly into the tibialis anterior 
muscle of both legs (50 in each leg). A boost was carried using the same 
procedure 4 weeks later. For comparison, control groups were included in all 
vaccine trials. These control groups were either unvaccinated animals or those 

40 administered with non-recombinant pcDNA3.1-h DNA (sham vaccinated) only, 

using the same time course described above. 3 weeks after the second immunisation, 
all mice groups were challenged intra-nasally with a lethal dose of S. pneumoniae 
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serotype 4 (strain NCTC 11886). The number of bacteria administered was 
monitored by plating serial dilutions of the inoculum on 5% blood agar plates. A 
problem with intranasal immunisations is that in some mice the inoculum bubbles out 
of the nostrils, this has been noted in results table and taken account of in 

5 calculations. A less obvious problem is that a certain amount of the inoculum for 
each mouse may be swallowed. It is assumed that this amount will be the same for 
each mouse and will average out over the course of innoculations. However, the 
sample sizes that have been used are small and this problem may have significant 
effects in some experiments. All mice remaining after the challenge were killed 3 or 

10 4 days after infection. During the infection process, challenged mice were monitored 
for the development of symptoms associated with the onset of S, pneumoniae 
induced-disease. Typical symptoms in an appropriate order included piloerection, 
an increasingly hunched posture, discharge from eyes, increased lethargy and 
reluctance to move. The latter symptoms usually "coincided with the development of 

15 a moribund state at which stage the mice were culled to prevent ftirther suffering. 

These mice were deemed to be very close to death, and the time of culling was used 
to determine a survival time for statistical analysis. Where mice were found dead, 
the survival time was taken as the last time point when the mouse was monitored 
alive. 

20 

Interpretation of Results 

A positive result was taken as any DNA sequence that was cloned and used in 
challenge experiments as described above which gave protection against that 

25 challenge. Protection was taken as those DNA sequences that gave statistically 

significant protection (to a 95% confidence level (p<0.05)) and also those which were 
marginal or close to significant using Mann-Whimey or which show some protective 
features for example there were one or more outlying mice or because the time to the 
first death was prolonged. It is acceptable to allow marginal or non-significant results 

30 to be considered as potential positives when it is considered that the clarity of some of 
the results may be clouded by the problems associated with the administration of 
intranasal infections. 
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Table 1 



4101.1 

ATGGAAGAGTTAGTGACCTTAGATTCrrTTGTTTATTGACAGAACTAAGATTGAAGCCAATGC^ 
5 TTGTGTGGAAGAAAACGACAGAGAAATTCTCCGCCAAACTTCAAGAACAGATACAGCTCTATTTTCA^ 

TCACTCCCCrrCTGATTAAATATGCCATGTTTGATAAGAAACAAAAGAGAGGGTATAAAGAGTC AGCT AAAAACT 
TAGCGAArrGGCACTATAATGACAAGGAGGATAGCTACACACATCCTGATGGCTGGTATTATCGTnTCACCATAC 
CAAATATCAGAAAACACAGACAGACTTTCAACAAGAAATCAAGGTTTACTACGCCGACGAACCrGAATCAGCCCC 
TCAAAAGGGACTGTATATGAACGAACGCTATCAAAACTTGAAAGCTAAAGAATGTCAGGCGCTT TTATCT 
10 AGGTAGACAGATTTTCGCTCAACGCAAGATTGATGTGGAACCTGTCTTTGGGCAGATAAAGGCl'lCl i IGGGTTAC 

AAGAGATGTAATCTGAGAGGGAAGCGTCAAGTGAGAATTGACATGGGATTGGTACTTATGGCCAATAACCrCCTA 
AAATATAGTAAAATGAAATAA 

4101,3 

15 atggggaaaggccattggaatcggaaaagagtttatagcattcgtaagtttgctgtgggaccttcctcagtaa 
attgggacttgtgcagttttattaggaggaaatatagctggagaatctgtagtitatgcggatgaaacact^ 
ctcatactgctgagaaacctaaagaggaaaaaatgatagtagaagaaaaggctgataaagctttgcaaact 
aatatagttgaaaggacagaacaaagtgaacrtagttcaactgagoctattgcatct 

GCCGTAACTCCAAAAGAGGAAj\AAGTGTCTGCTAAACCGGAAGAAAAAGCTCCAAGGATAGAAT^ 

20 AAATCAAGAA AAAC CGCTCAAGGAAGATGCTAAAGCTGTAACAAATGAAGAAGTGAATCAAATGATTGAAGACA 
GGAAAGTGGATTTTAATCAAAATTGGTACrrTAAACrCAATGCAAATTCTAAGGAAGC 
ACGTATCTACGTGGAAAAAATTAGATITACCGTATGACTGGAGTATCTTTAACGATrrCGATCATGAATCT^ 
ACAAAATGAAGGTGGACAGCTCAACGGTGGGGAAGCTTGGTATCGCAAGACTTTCAAACTAGATGAAAAAGACCT 
CAAG AAAAATGTTCGCCTTACT' i' 1 \ G ATGGCGTCTACATGGATTCTCAAGTTTATGTCAATGGTCAGTTAGTGGGC 

25 CATTATCCAAATGGTTATAACCAGTTCTCATATGATATCACCAAATACCrrrCAAAAAGATGGTCGTGAGAATGTG 
rrGCTGTCCATGCAGTCAACAAACAGCCAAGTAGCCGTTGGTATTCAGGAAGTGGTATCTATCGTGATGTGACTTT 
ACAAGTGACAGATAAGGTGCATGTTGAGAAAAATGGGACAACTATTTTAACACCAAAACTTGAAGAAC^ 
TGGCAAGGTTGAAACTCATGTGACCAGCAAAATCGTCAATACGGACGACAAAGACCATGAACTTGTAGCCGAATA 
TCAAATCGTTGAACGAGGTGGTCATGCTGTAACAGGCTTAGTTCGTACAGCGAGTCGTACCTTAAAAGCACATGA 

30 ATCAACAAGCCTAGATGCGATTTTAGAAGTTGAAAGACCAAAACTCTGGACTGTTTTAAATGAO^ 
TACGAATTGArrACGCGTGTTTACCGTGACGGTCAATTGGTTGATGCTAAGAAGGA' inU ' G ' iri 'GGT^ 
ATCACTGGACTCCAAATGAAGCTTTCTCTrrGAATGGTGAACGTATTAAATTCCATGGAGTATCCrTGCACC^ 
CCATGGGGCGCTTGGAGCAGAAGAAAACTATAAAGCAGAATATCGCCGTCTCAAACAAATGAAGGAGATGGGAG 
TTAACTCCATCC GTACA ACCCACAACCCTGCTAGTGAGCAAACCTTGCAAATCGCAGCAGAACTAGGTTTACTCOT 

35 TCAGGAAGAGGCCTrrGATACGTGGTATGGTGGCAAGAAACCTTATGACTATGGACGIll'Cri'l'GAAAAAGATGC 
CACTCACCCAGAAGCTCGAAAAGGTGAAAAATGGTCTGATTTTGACCTACGTACCATGGTCCAAAGACGCAAAAA 
CAACCCTGCTATCTTCATGTGGTCAATTGGTAATGAAATAGGTGAAGCTAATGGTGATGCCCACTCTrrAGCA^ 
GTTAAACGTTTGGrrAAGGTTATCAAGGATGTTGATAAGACTCGCTATGTTACCATGGGAGCAGATAAATTCCCTT 
TCGGTAATGGTAGCGGAGGGCATGAGAAAATTGCTGATGAACTCGATGCTGTTGGATTTAACTATTCTGAAGATA 

40 ATTACAAAGCCCTTAGAGCTAAGCATCCAAAATGGTTGATn^ATGGATCAGAAACATCTTCAGCTACCCGTACAC^ 
TGGAAGTTACTATCGCCCTGAACGTGAATTGAAACATAGCAATGGACCTGAGCGTAATTATGAACAGTCAGATTA 
TGGAAATGATCGTGTGGGTTGGGGGAAAACAGCAACCGCTTCATGGACTTTTGACCGTGACAACGCTGGCTATGC 
TGGACAGTTTA TCTG GACAGGTACGGACTATATrGGTGAACCTACACCATGGCACAACCAAAATCAAACTCCTGTT 
AAGAGCTCTTACTTTGGTATCGTAGATACAGCCGGCATTCCAAAACATGACTTCTATCTCTACCAAAGCCA^ 

45 TTTCTGTTAAGAAGAAACCGATGCTACACCTTCTTCCTCACTGGAACTGGGAAAACAAAGAAT^ 

TAGCTGACTCAGAAGG TAAGA TTCCAGTrCGTGCTTATTCGAATGCTTCTAGTGTAGAATTGTrCTT^ 
ATCTCTTCGTCTTAAGACmCAATAAAAAACAAACCAGCGATGG 

TGAACrrrATCTTGAATGGAAAGTrGCCTATCAACCAGGTACCTTGGAAGCAATTGCTCGTGATGA^ 
GAAATTGCTCCAGATA^GATTACGACTGCTGGTAAGCCAGCGGCAGTTCGTCTTATrAAGGAAGACOVTGCGATT 
50 GCAGCAGATGGAAAAGAGTrGACTTACATCTACTATGAAATTGTTGACAGCCAGGGGAATGTGGTrCCAACTG 

AATAATCTGGTTCCCrrCCAATTGCATGGCCAAGGTCAACTGGTCGGTGTACATAACGGAGAACAAGCCAGCCGT 

gaacgctataaggcgcaagcagatggttcttggattcgtaaagcatttaatggtaaaggtcttgccattgtca 
caactgaacaagcagggaaattcaccctg actgc ccacrctgatctcntgaaatcgaaccaagtcactg^ 
tggtaagaaagaaggacaagagaagactgttttggggacagaagtgccaaaagtacagaccattattggagagg 
55 cacctgaaatgcctaccactgttccgtttgtatacagtgatcgtagccctgcagaacctcctgtaacctggtcttc 
agtagatgtgagcaagcctggtattgtaacggtgaaaggtatggctgacggacgagaagtagaagctcgtgtaga 
agtgattgctct taaat cagagctaccagttgtgaaacgtattgctccaaatactgacttgaattctgtagaca^ 
tctgtttcctatgttttgattgatggaagtgrrgaagagtatgaagtggacaagtgggagattgccgaagaagata 

AAGCTAAGTTAGCAATTCCAGGTTCTCGTATTCAAGCGACCGGTTATTTAGAAGGTCAACCAATTCATGCAACCCT 
60 TGTGGTaGAAGAAGGCAATCCTGCGGCACCTGCAGTACCAACTGTAACGGTTGGTGGTGAGGCAGTAACAGGTCT 
TACTAGTCAAAAACCAATGCAATACCGCACTCTTGCTTATGGAGCTAAGTTGCCAGAAGTCACAGCAAGTGCTAA 
AAATGCaGCTGTTACAGTTCITCAAGCAAGCGCAGCAAACGGCATGCGTGCGAGCATCTTTATTCAGCCTAAAGA 
TGGTGGCCCTCTTCAAACCTATGCAATTCAATTCCTTGAAGAAGCGCCAAAAATTGCTCACTTGAG 
GAAAAAGCTGACAGTCTCAAAGAAGACCAAACTGTCAAATTGTCGGTTCGAGCTCACTATCAAGATGGAACGCAA 



30 



GCTGTATTACCAGCTGATAAACrrAACCrrCTCTACAAGTGGTGAAGGGGAACrrCGCAArrCCT 
GAGTrGCATAAGCCAGGAGCAGTCACTCTGAACGCTGAATATGAGGGAGCTAAAGACCAAGTTGA.A^ 
CAAGCCAATACTGAGAAGAAGArrGCGCAATCCATCCGTCCTGTAAATGTAGTGACAGATITGt^TC^ 
AGTCXrCCAGCAACAGTAACAGTrGAGTATGACAAAGGrrTCCCTAAAACTCATAAAGTCACTT^^ 
5 CGAAAGAAAAACTAGAGTCCTATCAAACATrrGAAGTACTAGGTAAAGTrGAAGGAATTGACCTTG^^ 

CAAAAGTCTCTGTAGAAGGTATCGTITCAGTTGAAGAAGTCAGTGTGACAACTCCAATCGCAGAAGCACCA^^ 
TACCAGAAAGTGTTCGGACATATGATTCAAATGGTCACGTTTCATCAGCTAAGGTTGCATGGGATGCGATTOT^ 
AGAGCAATACGCTAAGGAAGGTGTCrrTACAGTrAATGGTCGCTrAGAAGGTACGCAATTAACAACT^^ 
TGTTCGCGTATCTGCTCAAACTGAGCAAGGTGCAAACATITCrGACCAATGGACCCGTTCAGAATTGCCAC^ 

10 TTTGCTTCAGACTCAAATCCAAGCGACCCAGTrTCAAATGTTAATGACAAGCTCArrrCCTA 

CCAATCGTrGGACAAACTGGAATCGTACTAATCCAGAAGCTTCAGTCGGTGTTCTGTTTGGAGATTCAGGT^ 
GAGCAAACGCTCCGTTGATAATCTAAGTGTCGGATTCCATGAAGACCATCGAGTTGGTGTACCGAAGTCTTATGTG 
ATrGAGTATTATGrrGGTAAGACTGTCCCAACAGCTCCTAAAAACCCTAGTTTTGTTGGTAATGAGG 
TTAATGATrcrGCCAACTGGAAACCAGTTACTAATCrAAAAGCCCCTGCTCAACTCAAGGCTG^ 

15 ACrrrAGCrrrGATAAAGTTGAAACCTATGCTGTTCGTATrCGCATGGTTAAAGCAGATAACAAGCGTGG 

TATCACAGAGGTACAAATCTTTGCGAAACAAGTTGCGGCAGCCAAGCAAGGACAAACAAGAATCCAAGTTGACGG 
CAAAGACTTAGCAAACTTCAACCCTGATTTGACAGACTACTACCTTGAGTCTGTAGATGG^ 
ACAGCAAGTGrrAGCAACAATGGTCTCGCTACCGTCGTTCCAAGCGTTCGTGAAGGTGAGCCAGT TCGTG TCATCG 
CGAAAGCrGAAAj\TGGCGACATCrrTAGGAGAATACCGTCTGCACrrCACTAAGGATAAGAGCTTACT^ 

20 AACCAGTTGCTGCGGTTAAACAAGCTCGCTTGCTACAAGTAGGTCAAGCACTTGAArrGCCGACrAAGG^ 

TTACnrCACAGGTAAAGACCGCTACGAAACAAAAGACCTGACAGTTGAATGGGAAGAAGTTCCAGCGGAAAATCr 
GACAAAAGCAGGTCAATTTACTGTTCGAGGCCGTGTCCTTGGTAGTAACCrrGTTGCTGA 
ACAGACAAACTTGGTGAGACTCTTTCAGATAACCCTAACTATGATGAAAACACTAACCAGGCCTT^ 
ACCAATGATATTGACAAAAACTCTCATGACCGCGTTGACTATCTCAATGACGGA GATC ATrCAGAAAATCGTCGTT 

25 GGACAMCTGGTCACCAACACCATCTTO'AATCCAGAAGTATCAGCGGGTGTGATTITCCGTGAAAATGGTAA^ 

rrGTAGAACGGACTGTTACACAAGGAAAAGTTCAGTTCTTTGCAGATAGTGGTACGGATGCACCATCTAAACTCC^ 
TrrAGAACGCTATGTCGGTCCAGAGTTTGAAGTGCCAACCTACTATTCAAACTACCAAGCCTACGACGCAGACCAT 
CCATrCAACAATCCAGAAAATTGGGAAGCTGTTCCTTATCGTGCGGATAAAGACATTGCAGCTGGTGATGAAATC 
AACCTAACATTrAAAGCTATCAAAGCCAAAGCTATGAGATGGCGTATGGAGCGTAAAGCAGATAAGACCGGTGTT 

30 GCGATGATTGAGATGACCTTCCrrGCACCAAGTGAATTGCCTCAAGAAAGCACTCAATCAAAGATTCTTGTAGA^^ 
GAAAAGAACTTGCTGATTTCGCTGAAAATCGTCAAGACTATCAAATTACCTATAAAGCT^ 
CAGTTGAAGAAAACAATCAAGTAGCrrCAACTGTGGTAGATAGTGGAGAAGATAGCrrTTCCAG^^ 
CGTTTCAGAAAGTGGAAAACAAGTCAAGGAATACCGTATCCACTTGACTAAGGAA 

AAACCAGTTTCTGAGAAGACAGTTGCTGCTGTACAAGAAGATCTTCCAAAAATCGAATTTGTTGAAAAAGAT^ 
35 GCATACAAGACAGTTGAGAAAAAAGATTCAACACTGTATCTAGGTGAAACTCGTGTAGAACAAGAAGGAAAA 

GGAAAAGAACGTATCTTTACAGCGATTAATCCTGATGGAAGTAAGGAAGAAAAACTCCGTGAAGTGGTAGAAGTT 
CCGACAGACCGCATCGTCTTGGTTGGAACCAAACCAGTAGCTCAAGAAGCTAAAAAACCACAAGTGTCAGAAAAA 
GCAGATACAAAACCAArrGATTCAAGTGAAGCTAGTCAAACTAATAAAGCCCAGTTACCAAGTACAGGTAGTGCG 
GCAAGCCAAGCAGCAGTAGCAGCAGGTTTAACTCTTCTAGGTTTGAGTGCAGGATTAGTAGTTACTAAAGOT 

40 AAAGAAGACTAG 
410L.5 

ATGGATGCAATCrrrGACCTAATCGGAAAGGTTTTCAATCCCATCTTAGAAATGGGTGGACCTGTCATC^ 

45 TCATTTTGACAGTATrGGCTTTACTTTTTGGAGTGAAATTCTCC AAAGC GOT 

CGCTCTTACAGGTATCGGTGCTATCATCGGTATGCTAAACACTGCTTTCTCAGCATCACTAGCAAAATTCGTTGA^ 
AACACTGGTATCCAATTGAGTATTACCGACGrTGGTTGGGCACCACTTGCTACAATCACTTGGGGTTCTGC^ 
CACTATACTTCrrGCTCATCATGTTGATTGTCAACATAGTGATGCTAGCTATGAAGAAAACAGATACACTTGATGT 
CGATATCTTTGATATCTGGCACTTGTCTATCACAGGTCTCTTGATTAAATGGTATGCTGATAACAATGGTGTG^ 

50 CAAGGGGTTTCACTCrrTATTGCTACAGCAGCTATCGTCCTrGTCGGTGTGrrGAAAATTATCAACT 

GAAACCTACATTTGATGACCTTCTTAACGCCCCAAGTTCATCACCAATGACATCAACTCACATG AACTA C^^ 
AACCCAGTTATCATGGTTTTGGATAAGATTrrTGAAAAATTCTTCCCAGGCCrrGATAAAT^ 
CTAAATTGAACAAGAAAATCGGTTTCTGGGGATCTAAATTCTTCATCGGrrrCATCCTTGGTATCGT^ 
ATGGGAACrCCACATCCAATTGCAGCTGTTGCAGATGCAGATAAATGGCGTCTITGTTATCAAAGGATGGTTGTCTC 

55 TTGGTITGACTGCCGGTGTATCTTTGGAACTCTTCTCACTTATCGGTTCATGGTTCATCGCAGCCGTAGAACC^ 

TCACAAGCTATTACAAACGTTGCTACTAAACGTCTrCAAGGACGTAAATTCAATATCGGTCTTGACTGGC CA^^ 
TCGCTGGTCGTGCTGAAATCTGGGCTTGTGCCAACGTACTTGCACCAATCATGTTGATTGAAGCAGTGci iCi i iC 
AAAAGTTGGAAATGGTATCTTGCCACTTGCAGGTATCATCGCTATGGGTGTTACTCCAGCTCTCTTGGTTGTAACT 
CGTGGTAAATTGCTCCGTATGATTATCTTCGGAACACTCTTGTTGCCACTCnnrCCTTCTTTCAG 

60 ACCATTTGCAACAGAACrTGCTAAAGGTGTAGGTGCCTTCCCAGAAGGTGTGAGCCAAACTCAATTGATTACT^ 
TCTACTCTTGAAGGACCAATCGAAAAACrrCTTGGTTGGACAATTGGTAACACTACAACTGGTGATATCAi^ 
TCCITGGTGCAGTAGTCTTCCTTGTATTCTATATCGGTATCTTTGCTTGGTACAGAAAACAAATGATCAAA 
CGAAGAGTACGCAGCAAAAGCAAAATAA 

65 4102.1 
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ATGAACATTATGAAAAAAAAATATTGGACTTrAGCGATArrATTCTTTTGTT^ 
AGAAATACCTAAAAATCTTGATGGCAATATAACTCACACrCAGACTAGCGAAAGTTTr^ 
ACAGGTTGACTATTCTAATAAAAATCAAGAAGAAGTAGACCAAAATAAATTTCGTATTC/^ 
ATTATTTGTAACAACAGATAAACATTTAGAAAAAAACTGTTGTAAATTGGAACTTGA^ 
5 TArrGTTAACrCTGAAAGTAATAATTTACTAGGCGAAGATAATrrAGATAATAAAATTAAGGA^^ 

CTAGATAATAGAGGAGGAAATATAGAGCATGACAAAGATAACTTAGAATCGTCGATTGTAAGAAAATATGAATGG 

GATATAGATAAAGTTACTGGTGGAGGCGAAAGTTATAAATTATATTCTAAAAGTAATrCTAAAGTT^ 

TTTTAGATTCAGGAGTCGATTTACAAAATACrGGATTACTGAAAAATCTTTCAAATCACT^ 

CAATAAAGGATATTTAGGAAAAGAGGAGGGAGAGGAAGGAATAATATCAGATATTCAAGATAGATTAGGTCATG 
10 GTACGGCTGTTGTAGCTCAAATTGTAGGGGATGACAATATTAATGGAGT AAATCC TCACGTTAATATTAACGTCT^ 
TAGAATATTTGGTAAGTCGTCAGCTAGTCCAGATTGGATTGTAAAAGCAATTTITGATGCT 
GATATTATCAATCTTACTACTGGACAATATTTAATGATTGATGGAGAATATGAGGACGGAACAAATGATTT^ 
CATTTTTGAAGTATAAAAAGGCTATTGATTACGCGAATCAAAAAGGAGTAATTATAGTAGCTGCArrAGGG^^ 
ACTCCCTAAATGTATCAAATCAGTCAGATTTATTGAAACTTATTAGTTCACGCAAAAAAGTAAG/^^ 

1 5 AGTAGTTG ATGTTCC AAGTTATTTCTCATCT ACAATTTCGCTCGG AGGC ATAG ATCGCTTAG 

GTAATTTATCAGATTTTAGCAATAAAGGGGATTCTGATGCAATATATGCGCCT GCAGGCT CAACATTATCT 
AGAATTAGGACTTAATAACriTATTAATGCAGAAAAATATAAAGAAGATTGGATTTm 
ATATACGTATCTTTATGGAAACTCATTTGCTGCTCCTAAAGTTrcrrGGTGCGATT^ 
AATTAAAAGATCAGCCCTATAATTATATGTTTGTAAAAAAATTCTGGAAGAAACATTACCAGT^ 

20 

4106. i 

ATGAAGAAAACATGGAAAGTG ni 1 1 AACGCTTGTAACAGCrCrTGTAGCTGTTGTGCTTGTGGCCTGTGGTCAAG 

GAACTGCTTCTAAAGACAACAAAGAGGCAGAACTTAAGAAGGTTGACTTTATCCTAGACTGGACACCA^ 

ACCACACAGGGCTTrATGTTGCCAAGGAAAAAGGTTATITCAAAGAAGCTGGAGTGGATGrrGATTTG/^^ 

25 CACCAGAAGAAAGTTGTTCTGACTTGGTrATCAACGGAAAGGCACCATTTGCAGTGTATTrCCA^ 

TAAGAAATTGGAAAAAGGAGCAGGAATCACTGCCGTTGCAGCTATTGTTGAACACAATACATCAGGAATCATCTC 
TCGTAAATCTGATAATGTAAGCAGTCCAAAAGACTTGGTTGGTAAGAAATATGGGACATGGAATGACCCAACTGA 
ACTTGCTATGTTGAAAACCTTGGTAGAATCTCAAGGTGGAGACTTTGAGAAGGTTGAAAAACT 
CTCAAACTCAATCACACCGATTGCCAATGGCGTCTTTGATACTGCTTGGATTTACTACGGTTGGG^ 

30 GCTAAATCTCAAGGTGTAGATGCTAACTTCATGTACTTGAAAGACTATGTCAAGGAGTTTGACTA 

TTATCATCGCAAACAACGACTATCTGAAAGATAACAAAGAAGAAGCTCGCAAAGTCATCCAAGCCATCAAAAAAG 
GCTACCAATATGCCATGGAACATCCAGAAGAAGCTGCAGATATTCTCATCAAGAATGCACCTGAACrCAAGGAAA 
AACGTGACTTTGTCATCCAATCTCAAAAATACTTGTCAAAAGAATACGCAAGCGACAAGGAAAA^ 
TTGACGCAGCTCGCTGGAATGCrTTCTACAAATGGGATAAAGAAAATGGTATCCTTAAAGAAGACT^ 

3 5 AAGGCTTCACCAACG AATTTGTG AAATAA 

4106.4 

ATGATAAAAAATCCTAAATTATTAACCAAGTUl I U 1 i'AAGAAGTTTTGCAATTCTAGGTGGTGTTGGTCTAGTCA 
rrCATATAGCTATTTATTTGACCTTTCCri 1 1 1 ATTATATTCAACTGGAGGGGGAAAAGTTTAATGAGAGCGCAAG 

40 agtgtttacggagtatrtaaagactaagacatctgatgaaattccaagcttactccagtcttatta^ 

accatatctgctcaccttaaaagagatattgtagataagcggctccctcttgtgcatgacttggatattaaagatc 
gaaagctatcaaattatatcgtgatgttagatatgtct gttag tacagcagatggtaaacagg taaccg tgcaatt 

TGTTCACGGGGTGGATGTCTACAAAGAAGCAAAGAATATrrrGCTTTTGTATCTCCCATATACATm 
ATTG Cl - nTl CCTTTG ni 1 llCl i A' ri ' l ' ri ATACTAAACGCrrGCTCAATCCT L -r i - IinUA CATTTCAGAAGTGACT 

45 AGTAAAATGCAAGATTTGGATGACAATATrCGTTTTGATGAAAGTAGGAAAGATGAAGTTGGTGAAGTTG 

CAGATTAATGGTATGTATGAGCACTrGTTGAAGGTTATTTATGAGTTGGAAAGTCGTAATGAGCAAATTGTAA^ 
TGCAAAATCAAAAGGTTTCCTTTGTCCGCGGAGCATCACATGAGTTGAAAACCCCrrrAGCCAGTCT^ 
CCTAGAGAATATGCAGCATAATATTGGACArTACAAAGATCATCCAAAATATATTGCAAAGAGTATAAATAAGAT 
TGACCAGATGAGCCACTTATTAGAAGAAGTACTGGAGTCTTCTAAATTCCAAGAGTGGACAGAGTGTCGTGAGAC 

50 CTTGACTGTTAAGCCA Ol 1 i U GTAGATATTrTATCACGTTATCAAGAATTAGCTCATTCAATAGGTGTTACAATTG 

AAAATCAATTGACAGATGCTACCAGGGTCGTCATGAGTCTTAGGGCATTGGATAAG Oi 1 1 i GACAAACCTG ATTA 
GTAATGCAATTAAATATTCAGATAAAAATGGGCGTGTAATCATATCCGAGCAAGATGGCTATCTCTCTATCAAAA 
ATACATGTGCGCCTCTAAGTGACCAAGAACTAGAACATTTATTTGATATATTCTATCATTCTCAAATCGTGA CAGA 
TAAGGATGAAAGTTCCGGTTTGGGTCnTrACATTGTGAATAATATTrrAGAAAGCrrATCAA^ 

55 CTCCCTTATGAACACGGTATGGAATTTAAGATTAGCTTGTAG 

4106-6 

atgtatttaggagatttgatcgagaaagccgagtgtgctcaattttcaatact^ 

agacgaccgtcaaggctgtaatggaagaaacaggattttcaaaagcaaccctaaccaaatatgtcaccctgctca 
60 atgacaaggctttggatagtggcttagagct ggcta rrcactcagaagatgaaaatctgcgtctgtctatcggtgc 
agctaccaaggggagagatattcggagcttgtttttggagagtgctgttaaataccagattttggt^ 
taccaccaacaglun'lu'agcccatcagctggctcaagaattggtgattaccgaggctacgcttggtcgtcacttgg 
ctggttraaatcagattrrgtcagaatttgatttatccatccaaaatggccgttg^ 
tcactatttctatttctgtcttttccgaaaggtctggtcgagtcaggaatgcgaaggtcacato 
65 agaaaacaggagattgccaatttagaggaaatctgcggtgcaagtttctctgcggggcagaaattggacttgct^ 
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CTCTGGGCTCACATCAGTCAACAACGTCTTCGGGTCAATGCTTGTCACrriTCAACT^ 

GGGTATTTTGACAATATCrriTATCnTCGTTTGCTCAGAAAGGTTCCGTCCT^ 

AGGAGTTGAGGATGGTGAGATGATGATATTCTrCTCTTTTCTCCTATCTCATCGCATTOT 

AGTATATTCTrGGTTTrGGAGGGCAGTTGGCAGATTTACTGACGCAATTGATTCAAGAAATGAAGAAGGAGG^ 
5 TATTGGGGGATTATACAGAGGACCATGTCACCTATGAACrCAGTCAGCTTTGTGCrCAAGTCTATCTCT ATAA ^ 
CTATATrrTACAGGATCGCTACAAGTACCAGTTAGAGAATCGTCATCCATATTTACTGATGGAACATGATm 
GAGACAGCAGAGGAGATTTITCATGCTCTACCTGCTTTTCAACAGGGGACAGATTTAGATAAGAA 
AATGGCTCCAGrrAATCGAATATATGGCTGAAAACGGTGGCCAGCATATGCGGATrGGTCTGGATrTGACATCTCG 
TTTTGTrGTCTTTTCAAGGATGGCAGCCATTTTGAAACGGTATTTGGAATAC^ 
10 ATGACCCTAGTCGGCATTATGATTrGCTGGTTACCAATAACCCGATTCATAAGAAGGAACAGACACCAGTCrAT^^ 
TTTAAAAAATGACTTGGATATGGAGGATTTGGTAGCGATTCGCCAGTTATTATTCACrr^ 

4106.7 

ATGGAATTTTCAAAGAAAACACGTGAATTGTCAATTAAAAAAATGCAGGAACGTACCCTGGACCTCT^ 
15 GGTGGAGGAATCACAGGAGCTGGTGTAGCCTTGCAGGCGGCAGCrAGCGGTCTTGAGACTGGTTTGATrGAA^ 
CAAGACTrrCCAGAAGGAACATCTAGTCGTTCAACAAAATTGGTTCACGGAGGACTTCGTTACCrC^ 
ACGTAGAAGTGGTCTCAGATACGGTTTCTGAACGTGCAGTGGTTCAACAAATCGCrCCACACATTCCAAAATCAG 
ATCCAATGCTCTTACCAGTTrACGATGAAGATGGAGCAACCTTTAGCCTCTTCCGTCTT 

GTACGACCTCTTGGCAGGTGTTACCAACACACCAGCTGCGAACAAGGTTTTGAGCAAGGATCAAGTCTTGG^ 

20 CCAGCCAAACnTGAAGAAGGAAGGCTTGGTAGGAGGTGGAGTGTATCTTGACTTCCGTAACAACGATGCGCCT 

CGTGATTGAAAACATCAAACGTGCCAACCAAGACGGTGCCCTCATrGCCAACCACGTGAAGGCAGAAGGCTTCCr 
CriTGACGAAAGTGGCAAGATTACAGGTGTTGTAGCTCGTGATCTCTrGACAGACCAAGTGTTTGAi^ 
CGTCTGGTTATTAATACAACAGGTCCTTGGAGTGATAAAGTACGTAATTTGTCTAATAAGGGAACGCAArrCT 
AAATGCGCCCAACTAAGGGAGTTCACTTGGTAGTAGATTCAAGCAAAATCAAGGTrrCACA 

25 ACACAGGTTTGCGTGACGGTCGTATGGTCTTTGTTCTCCCACGTGAAAACAAGACTTACTTTGGTACAACT 

AGACTAdACAGGTGATTTGGAGCATCCAAAAGTAACTCAAGAAGATGTAGATTATCTACTTGGCATTGTC^ 

CCGCTTCCCAGAATCCAACATCACCATTGATGATATCGAAAGCAGCTGGGCAGGTCTTCG 

TCCATTGATrGCAGGGAACAGTGCCTCTGACTATAATGGTGGAAATAACGGTACCATCAGTGATGAAAGCTT^ 
CAACrrGArrGCGACTGTTGAATCTTATCTCTCCAAAGAAAAAACACGTGAAGATGT^ 

30 OTGAAAGTAGCACATCTGAGAAACAriTGGATCCATCrGCAGTTTCTCGTGGGTCTAGOT 

ATGGTCTCTTGACTCTrGCTGGTGGTAAAATCACAGACTACCGTAAGATGGCTGAAGGAGCTATGGAGCGCGTGG 
TTGACATCCTCAAAGCAGAATTTGACCGTAGCTITAAATTGATCAATrcrrAAAACTTACCCT 
ATTGAACCCAGCAAATGTGGATTCAGAAATCGAAGCCnTTGCGCAACTTGGAGTATCACGTGGT^ 
GGAAGCTCACTATCTGGCAAATCTTTACGGTTCAAATGCACCGAAAGTCTTTGC\CT^ 

35 GCGCCAGGACTCAGCTTGGCAGATACTTTGTCCCnTCACTATGCAATGCGCAATGAGTTGACTCrrAG C^ 

ACTTC CrrL - 1 1 CGTCCTACCAATCACATGCTCTTTATGCGTGATAGCTTGGATAGTATCGTTGAGCCAATTTTGGAT 
GAAATGGGACGATTCTATGACTGGACAGAAGAAGAAAAAGCAACTTACCCTGCTGATGTCGAAGCAGCTCrCGCT 
AACAACGATTTAGCAGAATTAAAAAATTAA 

40 4106.8 

ATGATGAATGAATTATTTGGAGAATTTCTAGGGACTTTAATCCTGATTCTTCTAGGAAAT^ 
TGGTTCTTCCTAAAACCAAGAGCAATAGCTCAGGTTGGATTGTGATTACTATGGGTTGGGGGATTGCAGrrGCG 
TGCAGTGTTTGTATCTGGCAAGCTCAGTCCAGCTTATTTAAACCCAGCTCTGACCATCGGTGTGGCCTT^ 
GGTlTGCCTTGGGCTTCCGTTTTGCCTTATATCTrAGCCCAGTTCGCAGGGGCCATGCTGGGTCAGAT^^ 

45 GTTGCAATTCAAACCTCACTATGAGGCAGAAGAAAATGCAGGCAATATCCTGGCAACCTTCAGTA CTGGACCAG C 
CATCAAGGATACTGTATCAAACTTGATTAGCGAAATCCTTGGAACTTTTGTTrTGGTO 
GTCrTTACGACTTTCAGGCAGGTATCGGAACCTTTGCAGTGGGAACTTrGArTGTCGG 
TGGGACAACAGGTTATGCCTTGAACCCAGCTCGTGACCTTGGACCTCGTATCATGCACAGCATCn'GCCAA 
AACAAGGGAGACGGAGACTGGTCTTACGCrrGGATTCCTGTTGTAGGCCCTGTTATCGGAGCAGCCTTGGCAGTG 

50 CTTGTATTCTCACTTTTCTAG 

4106.10 

ATGAAAAGGACCTGGAGGAACTCATTCGTGACAAATCTTAATACACClU"i"IATGATTGGCAATATTGAGATTCCCA 
ATCGTACCGTTTTAGCGCCTATGGCTGGCGTGACCAACTCAGCCrrrCGTACTATCGCAAAGGAGCTCGGAGCTC^ 

55 ACrCGTTGTAATGGAAATGGTCTCrGACAAGGGAATCCAATACAACAACGAAAAAACCCTGCACATGC^ 

CGATGAGGGCGAAAACCCTGTCrCrATCCAACTTTTTGGTAGCGATGAAGACAGCCTAGCACGCGCAGCAGAATr 
CATCCAAGAAAACACCAAGACCGATATCGTCGATATCAACATGGGCTGCCCTGTCAACAAAATCGTGAAGAACGA 
AGCTGGTGCTATGTGGCTCAAGGATCCAGACAAGATTTACTCCATCATCAACAAGGTCCAGTCTGTCCTTGATATC 
CCACTTACTGTCAAAATGCGTACCGGCTGGGCGGACCCATCTCnTGCAGTAGAAAATGCrCTCGCTGCTGAAG^^ 

60 CAGGTGTTTCTGCCCTCGCCATGCATGGCCGTACCCGTGAACAAATGTATACTGGCCACGCAGACCTTGAGACCCT 
TTACAAGGTTGCCCAAGCTCTAACCAAGATTCCATTCATCGCCAACGGTGATATCCGTACTGTCCAAGAAGCC^ 
GCAACGCATCGAAGAAGTTGGTGCTGACCCAGTCATGATTGGCCGAGCTGCCATGGGAAATCCTTACCTCTTCAA 
CCAAATCAACCATTACTTrGAAACAGGAGAAATCCTACCTGATTTGACCTTTGAAGACAAGATGAAGAT^^ 
CGAACACTTGAAACGATTGATTAACCTCAAAGGAGAAAACGTCGCAGTTCGTGAATTCCGCGGTCTCGCTCCrCA 
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CTATCTCCGTGGAACATCTGGCGCTCCCAAACTCCGTGGAGCCATTTCCCAAGCCAGCACCCTGGCAGAGATTGA 
AACCCTCTTGCAATTGGAGAAGGCTTAA 



5 

4107.1 

ATGACAAAGAAGAAAATTGAGCGTATrTCTCTAATACACCGAGAAAAGATTTTATGGCTCAAGTGGTAT^ 
GAGATAAAGAACAACCTAAGTATAGTGTCCTTGAGCGTAAAATGTTrGATGCTGCTAAAAATCAAG ATATG CTA^ 
10 cnTATCAAAAATACGCAACTATCAAGCAGATAACAGATATTAGGGTACAAACAAGTGAGGCTGACATm 
CTGTAAAAGAGGrrrATGTGTACAATCACATGAATGTTATCGGAGCTTGTCAGCGGATATTATTTATCAC^^ 
ACCAGCTTATGATAAGTTAAATAAGTGGTTTAATATCTATTCTGATTTGTATTT^ 
TGGGGGTATATCATGAGATGGTAGGTATCTAG 

15 4107,2 

ATGAAAAATTCCAACGAGGCTGAGATGAAATTACTTTATACTGATATTCGGAC I 1 C i'l 'rGACAG AAAT TCTAACAA 
GAGAGGCAGAAGAGCTAGTTGCAGCTGGCAAGCGGGTCTTCTACATTGCCCCCAACTCTCi'i'lCl 1 1 IGAAAAGGA 
ACGCGCCGTGCTGG AATACTTGTCCCAGCAGGCi'i C I ' I' J T 1 CG ATTACCGTCACGCGCTTTGCTCAAATGGCTCGC 
TATCTGGTCTTGAATGATTrACCAGCTAAAACTACTCTTGATGATATCGGTCTTGGGTTGGCCr^^ 

20 TGCCGAACTCGATCCCAAGGACTTGCGTGTTTATG GCGCTA TTAAGCAGGATCCTCAATrGATCCAGCAGTTAAT^ 
GAGCTTTACCATGAGATGACCAAATCTCAGATGAGi i i i 1 iGGACTTGGAGAATTTAACAGATGAGGATAAGAGG 
GCGGATTTACTCrTGAi i i 1 1 GAGAAAGTAACAGCCTATCTTAATCAAGGTCAGTTAGCCCAGGAAAGTCAGTTGT 
CCCATTTGATTGAGGCTATTGAGAATGACAAGGTAAGTAGTGATmAATCAAATCGCCTTCGTCATTGACGG 
TACTCG I 1 i i I CTGCTGAGGAAGAGCGGGTTGTGGACnTACTTCACGGCAAAGGTGTrGAGATTGTTATCGGGGCT 

25 TATGCTA<7rAAGAAAGCCTATACCAGTCCi'l'ilAGCGAGGGCAATCTCTACCAAGCCAGCGTAAAATITCT^ 
ATCTGGCTTCTAAATACCAAACGCCTGCTCAGGACTGTTCTCAAACTCATGAGAAGATGGATAGTm 
CTCTCGrrrGTTGGAGTCTTCTTATGACTTTTCAGAACTCGCTTTGGATGTC^ 

CAAATCTCGTCTTGTTTGACGCAAAJ^GGAGGAGTTGGAGCTAGTAGCCCGTAGTATTCGTCAGAAATTACATG^ 
AACrCAGACCTGAGCTACAAGCATTTTCGTATTCTCTrGGGGGATGTAGCTTCTTACCA 

30 ITl i I GACCAGTATCAGATrC Ci i ' l ' ll 'ATCTTGGTAGAAGCGAAGCCATGGCTCATCATCCCTTGACTCAGTn'CTC 

GAGTCTATTTTAGCTTTAAAACGTTACC U I CGTCAGGAGGATTTGATTAAT CrrClT AGAACTGATTTG TATAC 
TGACCTCAGTCAGTCTGATATTGATGCTTTTGAGCAATATATCCGCTATCTTGGTATCAATGGCrrGCCAG^ 
AGCAAACCTTCACCAAATCCCACCATGGAAAATrTAATCTTGAGCGTTTGAATGTCCTCCGCCTGA GA^ 
ACCTCTTGAAACCCTCTTTGCCAGCCGAAAACAAAAGGCTGAAAAACTCCTACAAAJ^ 

35 AGAAGGAGCTGTGACCAAGCAGTTACAACATTTGACAACCACrrTGGAAGCTGTAGAACAGGAAAGACAAGCCG 
AAGTTTGGAAGGCTTTCTGCCAT U ' rri - I 'AGAACAArrrGCGACTG i 1 i i I GCTGGTTCACAGGTTAGTCrGGAAGA 
CTrCCTAGCCTTGCTCCATTCTGGAATGAGTTTGTCCCAATACCGTACCATTCCAGCAACAGTGGACACTCTT 
GTGCAGAGTTACGATTTGATTGCACCATTGACTGCTGACTTTGTCTATGCTATTGGACTAACT^ 
CAAAAATTTCTCAAAACACCAGTCTTCTGACAGATGAAGAAAGGCAAAACCTAAACCAAGCGACCGAAGAAGGC 

40 GTTCAATTACTGATTGCCAGCAGTGAAAATCTCAAGAAAAATCGCTACACTATGCTTTCCTT 

GTAAGCAGTrGTTCTTGTCGGCTCCAAGCCi 1 1 1 1 AACGAAAGTGAAAGTAAGGAATCTGCCrATCTTCAAGAGTT 
GATCCATTTTGGATTTAGGCGGAGAGAGAAGAGGATGAATCACAAAGGACTGTCTAAGGAGGATATGGGGTCCTA 
TCAC AGTC r i ■ i ■ i GTCTAGTCTGGTTGCCT ATC ACCAGCAGGGTG AG ATG AG CG AT ACTG AGC AAG ATTTG ACTTTT 
GTCAAGGTTCrGTCGCGTGTCATAGGTAAAAAACTAGATCAGCAAGGTCTGGAAAATCCAGCTATCCCAACCAGT 

45 CCAAGCAGCAAGACCTTAGCCAAGGACACCTTGCAAGCTCT 

CTATCCTGCC AAAC AGGAGTTrrACCTGTCTACGTCGGGTTTGACAGAGTTTTATCGa^ 
CTACGCTACGrrrrAGGCTTGCAGGAGGAATTACGTTTGCATCCTGATGCCCGTAGTCACGGGAATTTCT^ 
GTATCTTTGAACGCGCCTTACAGrrGCCTAATGAAGATTCCTTTGACCAACGTCTAGAACA^ 
CAGTCAAGAACGCGAATTTGAAGCTATTTATCAAGAAAGTTTGGAAGCCCAGTTTACCAAGGAAGTT^ 

50 GTTGCACGGACAAC TGGA CATATTCTCCGACACAATCCAGCCATCGAAACCATCAAAGAAGAAGCAAATrrTGGT 
GGAAAAGACCAAGCCTTTATTCAATTAGACAATGGACGCAGTGTCTTTGTACGAGGCAAGGTGGACCGGATTGAC 
CGTTTGAAAGCTAATGGAGCGATAGGAGTAGTAGACTACAAATCCAGTCTGACTCAGTTCCAGTTTCCTCATTTCT 
TTAATGGGCTCAATTCTCAGTTACCAACCTATCTTGCTGCCCTAAAAAGAGAAGGGGAGCAGAACn"l'l"l'l 
CATGTACTTGGAAATGGCTGAACCTGTCCAATCTCTGATGGCGGTAAAAAGTCTGGCAGGAGCAGTGGTAGAAGC 

55 CAGCAAATCTATGAAATACCAAGGGCTCn CI IGGAAAAAGAAAGCAGTTATTTAGGCGAATTTTATAACAAAAA 

CAAGGCT AATC AACTGACAGATGAGGAATTTCAGCTCCTACTGGACTACAATGCCTATCrrTACAAGAAAGCT 
GAGAAGATTTTAGCAGGCCGGTTCGCCATCAATCCTTATACTGAAAATGGCAGAAGCATTGCCCCATACGTCCAG 

caacatcaggctattacaggctttgaagccaattaccatctgggccaagcccgtttcctagaaaagttggacct 
ctgatggcaagcgtctggtcggagaaaaactcaagcaagcttggcttgaaaaaataa 
60 gagaggagttgaatcgatga 

4107.3 

ATGAAGCTTATTCCClU'rriAAGTGAGGAGGAGATTCAAAAACTGCAAGAAGCAGAAGCAAATTCGAGCAAGGAA 
GAG AAG AAAACTGTCG AGCAAATCG A AGCT ATCTACACTTCTGCCC AG AATATCCTGGTCTCAGCATCGGCTGGT 
6:> TCTGGAAAGACCrrTGTCATGGCAGAGCGCATTCTGGACCAATTGGCGCGTGGTGTCGAAATTTCTCAACTCTTTA 
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TCrCAACCrrTACCGTCAAGGCTGCAACTGAACrrAAAGAACGTTTAGAGAAAAAAAT^ 
AAACAGATGATGTCGACCTCAAACAACACTTGGGTCGCCAGTTGGCAGACCTACCCAACGCTGCC>rr^^ 
TGGATTCTrTCACACAAAAArrCCTTGGCAAACATGGTTATCTGCTTGATATTGC^^ 
AACCAAAGCGAGCAACTTATTCTCGAAAACGAAGTCTTTCATGAGGTCTn-GAAGCGCAT^^ 
5 AAAGAGACCTrrAGTCATTTGCTGAAAAACnTTGCTGGGCGTGGCAAGGACGAACGG GGTCT ^ 
TATAAAATCTATGACTTCCTCCAATCCACCAGTAATCCTCAAAAGTGGCrGAGTGAA TCT^ 
AGAAAGCrGATTTTACCAGTGAAAAAGAAAAACTGACCGAGCAAATCAAACAAGCCCm 
TTTTCCGTTACCATCTGGATAACGATGCCAAGGAGTTTGCAAAGGCTGCCTATrTAGAAAATGW 
GGATGAAATTGGCTCCCTAAATCAGGAGTCCGATAGTCAGGCTTATCAGGCAGTGCTTGCGCGTGTTGTCGCCATC 

10 TCTAAGGAGAAAAACGGTCGAGGTCTGACTAATGCCAGCCGTAAGGCTGArrrGAAGCCCCTGGCTGATG 

AJ^CGAAGAGAGAAAGACCCAGTTTGCTAAACTAGGACAATTATCAGACCAGATAGCGAT 

TCTCGACTATCAAGAACGTTATCATGGAGACACTTCGAAACTAGCTAAAACCTTCCAATC^ 

CTAGAGGCTTATCGTCAGAGAAAACGACAGGAAAATGCCTTCGAATTCGCTGATATCAGCCATrACACCATTGAG 

ArmAGAGAATrrCCCACAAGTTCGTGAGTCrrATCAGGAGCGCnTCCATGAAGTCi^^ 

15 ATACCAACCATATTCAAGAACGGATGCTGGAATTGTTGTCTAATGGCCACAATCGCTTTATGGTGGGAGATATCAA 
GCAATCCATCTATCCTITCCGTCAGGCAGACCCGCAGATTrrCAATGAGAAATTCCAACGCTATGCGCA^ 
CAAGAAGGCAGGCTCATTATCCTCAAGGAAAATTTCCGTAGTAGTTCAGAAGTGCTGTCAG CAACC AATGATGTC 
rrrGAACGTCTCATGGACCAAGAGGTCGGCGAAATCAACTATGATAACAAGCACCAGCTTGi 1 i i iGCCAATACCA 
AACTGACTCCCAATCC^GACAACAAGGCAGCATTTCTCCTCTACGACAAGGACGATACAGGTGAGG^ 

20 GTCAAACAGAAACGAAACTAACAGGCGAAATGCGCTTAGTTATCAAGGAGATTCTGAAACTTCATCAAGAA^ 
GTGrrGCCTTTAAGGAAATTGCCCTTCTGACCTCCAGCCGCAGTCGTAATGACCAGATTCTCCTCGCC 
r.TArnnAATTCCTGTCAAAACTGACGGAGAGCAAAACAATTATCTCCAATCCCTAGAAGTGCAAGTCATG CTAGA 
CACTCTTCGTGTCATTCACAATCCCCTGCAAGACTACGCCTTGGTTGCCCTTATGAAGTCTCCAATGT^ 
ATGAGGATGAGCTAGCACGTTTGTCCCTTCAGAAAGCAGAGGATAAAGTCCACGAAAATCrCTATGAGAAACTGG 

25 TCAATGCACAAAAAATGGCAAGTAGTCAAAAAGGCTTGATTCACACAGCTCTAGCTG 

AAAAACTAAAGCAATTCATGGATATCCTAGCTTCnTGGCCCTTGTATGCCAAAACCCACTCT 
TTGGAAGATTTACAACGACCGTnTrATTATGACTATGTTGGGGCrrrGCCGAATGGTCCT^ 
CTCTATGCCCTAGCACTGCGTGCTGATCAATTTGAAAAGAGCAATrrCAAAGGTTTGTCGCGTT^ 
TTGACCAAGTCTTAGAAGCCCAGCACGATTTGGCAAGCGTGGCCGTCGCACCGCCAAAAGATGCAGTAGAGCrCA 

30 TGACCATCCACAAGAGTAAAGGGCTGGAGTTTCCTTACGTCTTTATCCTCAATATGGATCAAGATITC^ 

AGACTCTATGTCAGAAGTCATTCTCAGTCGTCAGAATGGTCTTGGTGTCAAATATATTGCCAAGATGGAGACAGGG 
GCAGTAGAAGACCACTATCCTAAAACCATCAAACTCTCCATTCCTAGTCTGACCTATAGGCAGAACGAAGAGGAA 
TTACAGCTAGCAAGCTATTCTGAGCAGATGCGTTrGCTGTATGTTGCTATGACGCGGGCTGAGAAAAAGCTCTATC 
TTGTCGGCAAGGGTTCrCGTGAAAAGCTGGAATCCAACGAATACCCAGCAGCCAAAAATGGGAAACTAAATAGCA 

35 ATACTAGACTGCAAGCACGGAATTTCCAAGATTGGCTTTGGGCTATCAGTAAAGTGTTTACTAAGG 

ACTTTAGTTATCGTTTTATTGGCGAAGATCAGTTGACCAGAGAAGCTATCGGAGAGTTGGAAACCAAGAGTCCTCT 
CCAAGATAGCTCCCAAGCAGACAATCGTCAGTCAGATACCATCAAAGAAGCTCTGGAAATGCTGAAGGAGGTGGA 
AGTrrATAATACTCXrCACCGCGCAGCTATrGAACrTCCTAGTGTTCAAACCCCAAGTCAAATCAAGAAATT 
GAACCAGTTATGGATATGGAAGGTGTCGAGArrGCTGGTCAAGGTCAGTCAGTAGGCAAGAAAATCAGCrrCGAT 

40 TTGCCAGATTTTTCAACCAAAGAAAAGGTAACTGGAGCTCAGATTGGTAGTGCTACTCACG 

ATTGACCTCAGCCAGCAACTAACCCTTGCTAGCCTAACAGAAACACTCAAACAAGTTCAAACTAGCCAAGCT^ 
AG AG AC AAG ATCAATCTTG ATAAAATTCTTGC I T I'C IT i G AC ACAGT ACTCGGTCAGGAAA TTCTT GCTAATACCG 
ACCATCmATCGCGAGCAACCTTTCTCCATGCrCAAACGAGACCAAAAGAGTCAGGAAGACTTTGTTGTCCGTGG 
TATCCTTGATGGCrATCTGCTTTACGAAAJVCAAAATTGTrCTGTTCGACrACAAGACAGACCGCTATGATGAACCA 

45 AGTCAACTCGTAGACCGCTATCGTGGTCAGTTAGCrCTATACGAAGAGGCrrTATCACGAGCCTATTCGATTGAAA 
ATATTGAAAAATACTTGATTTTACTCGGTAAAGACGAGGTTCAAGTTGTAAAAGTATAA 

4109.1 

ATGGAACITGCTCGCCATGCTGAAAGCTTGGGAGTAGATGCTATTGCAACGArrCCACCAATTTATrrCCGCr^ 
50 CAGAATACrCAGTTGCCAAATACTGGAACGATATCAGTTCrGCAGCTCCAAACACAGACTACGTGATTTACAACA 
TTCCTCAATTGGCAGGGGTTGCTTTGACTCCAAGCCrrrACACAGAAATGTTGAAAAArCCTCGTGT^ 
GAAGAACTCTTCTATGCCAGTTCAAGATATCCAAACCTTTGTCAGCCTTGGTGGAGAAGACCATATCGTCT^ 
GGTCCTGATGAGCAGTTCCTAGGAGGACGCCTCATGCCGGCTAGGGCTGGTATCGGTGGTACTTATGGTGCTATGC 
CAGAACTCTTCITGAAACTCAATCAGTTGATTGCGGATAAGGACCTAGAAACAGCGCGTGAATTGCAGTATGCTAT 
55 CAACGCAATCATTGGTAAACTCACTTCTGCTCATGGAAATATGTACGGTGTCATCAAAGAAGTCTTGAAAATCAAT 
GAAGGGTTGAATATTGGATCrGTTCGTTCACCATTGACACCAGTGACTGAAGAAGATCGTCCAGTTGTAGAAGCG 
GCTGCTGCCTTGATTCGTGAAACCAAGGAGCGCTTCCTCTAA 

4110.2 

60 ATGTATAAGACAAAGTGTTTACGAGAGAAGTTAGTATTATTTTTAAAAATTTTCTrCCCAATCCT 

TTGCCAATTATTCTGCCTCrrrTGTTGATACTGC^^ATGACAGGTCAATACAACACTATGGACTTGGC^^ 
ATGGCAACCAGTATCTGGAATC Li i I C i Vl ACAnTCTAACAGGG ATTGTGTCAGCCTTGGTGCCTATCATTGGTCA 
CCATCrrGGTCGAGGCAAAAAGGAAGAAGTTGCGTCTGATTTITACCAATTTATTrATT^ 

GTGGTCTTGCTGGGGATGGTA Cl i H Ci 1 GGCACCAATAATCTTGAATCATATTGGGTTAGAAGCAGCAGTAGCGG 
65 CAGTAGCGGTTCGCTATCTTTG OI 1 1 r l ATCTATCGGGATTATCCCCTrGTTGCTCTTTAGCGTCATTCGTTCCTTGC 



35 



TGGATTCGCTGCGCTTGACCAAACTGTCCATGTACCTCATGCTTrrGTTACT^ 

CTCXTGArrTACGGTGCCTTTGGTGTTCCAGAACTGGGAGGGGCTGGTGCTGGTrTAGGAACATCOT 
GGGTCirGCTrGGGATTrCTGTTCTGGTmATTTAAACAGGAGAAGCTCAAAGCC^ 

TCCACnTAATATGGATAAAATTAAGGAAGGAGTrCGTTTAGGTCTGCCTATTGGGGGAACTGTCrrCGC^ 
5 GCTATCrnTCAGTGGTrGGCTTGATrATGGCTAAGTTrrCGCCCTrGATTATAGCT 

CrrTTCAAGTCTTATGTACGCCTTTCCTATGAGTATCTCATCGGCTATGGCTArrGTCC^^ 
CCAAGCGATTTGATGATGCGAAAACCTATATTGGTCTAGGAAGATGGACTGCCCTCAri 4 1 1 GCGGCCTTCACCrr 
AACCTrCCmACATTTTrAGGGGAAATGTGGCCAGTCTTTATGGTAACGACCC^^ 
CGTTTmAACTTATAGTCrrTTCrrCCAGTTAGCAGATACCTTrGCGGCGCC 
10 TAAGGATACAGTTATTCCTTTTrACCTTGGTrrGCTTGGTTATTGGGGCGT^^ 

ATGAGTACTTTAGCAAAAATAGAAGCGCTCTTGrr^ 

GAACTCCTCTCTCTGCCACCGACAGGCATCCAGCAAAGTTTAGGAAAATTAGCCCAGAAGTATGAAAAGG^ 

15 GATTCCAGTrTGGCTTrGATrGAGACAAGTGGTGCTTATAGATTGGTGACCAAGCCTCAATT^ 

AGGAATACTCTAAGGCGCCTATCAACCAGAGCTTGTCTCGGGCTGCCCTTGAGACCTTGTCCArrATTGCCT^ 
ACAGCCGATTACGCGGATAGAAATTGATGCCATCCGTGGAGTTAACTCGAGTGGAGCCTTGGCAAAGTTGCAGGC 
rnTGACCTGATAAAGGAAGACGGGAAAAAGGAAGTATTGGGGCGCCCCAACCTCTATGTGACTACGGATTATTT 
CCTAGATTACATGGGGATAAACCATITAGAAGAATTACCAGTGATOATGAGCTTGAGATTCAAGCCCAA^ 

20 CCAATTATTTGGTGAAAGGATAGAAGAAGATGAGAATCAATAA 

ATCGATACGATGATTAGTAGATTTTTrCGCCATTTAT^ 

CAGTAGCTGCTGTCAGTTCAGTCATGATTACmGACCTTGGTGGCAATATTTGCATCTGTTA 
25 AAACTAGCTACAGATATTGAAAATAATGTCCGTGTAGTAGTTTATATCCGAAAGGATG TGGAA GATAATAGTCAG 

ACAATTGAAAAAGAAGCTCAAACTGTTACAAATAATGACTACCACAAGGTATATGATTCTl^ 
ACGGTTAAAAGTGTTACCrrTTCAAGTAAAGAAGAACAATATGAAAAATTAACCGAGATAATGGGAGAT^ 
AAAATCTTTGAAGGAGATGCCAATCCTCTCTATGATGCCTATATTGTAGAGGCAAACACTCCAAATGATGT/^^ 
ACTATAGCCGAAGATGCTAAAAAAATTGAAGGTGTCTCTGAGGTTCAAGATGGCGGTGCCAATA CAGAA AG 

30 TTCAACTTAGCTTCATTTATCCGTGTTTGGGGACTAGGGATTGCTGCTTTGTT^ 

TTCAAATACCATTCGTATTACCATTATrrCCCGCAGTCGCGAAATTCAAATCATGCGCTTGGTCGGAGCT 
AGTrATATCCGTGGACCGTrCTTGTTAGAAGGAGCCTrrATCGGTrrATTGGGAGCTATCGCACCATCT^ 
CTTTATTGTrrATCAAATTGTTTACCAATCTGTCAACAAATCGTTGGTAGGGCAAAATCrATCCATC 
GATrrATrrAGTCCGTTGATGATTGCCCTACTATTTGTGATTGGGGTTTTCATTGGTTCA^^ 

35 CATGCGCCGATTCTTGAAGATTTAG 

ATG/^GAAAGTAAGATTTATTTTITTAGCTCT^ 

GTACTTGGCAAGGAAAACAGTATCTGAAAGAAGATGGCAGTCAAGCAGCAAATGAGTGGGi 1 i i iGATACTCATT 

40 ATCAATCTTGGTTCTATATAAAAGCAGATGCTAACTATGCTGAAAATGAATGGCTAAAGCA^ 

rTACCTCAAATCTGGTGGCTATATGGCCAAATCAGAATGGGTAGAAGACAAGGGAGCCTTITATTATCTTGACCAA 
GATGGAAAGATGAAAAGAAATGCTTGGGTAGGAACTTCCTATGTTCGTGCAACAGGTGCCAAAGTAATAGAAGAC 
TGGGTCTATGATTCTCAATACGATGCTTGGriTTATATCAAAGCAGATGGACAGCACGCAGAGAAAGAATGGCTC 
CAAATTAAAGGGAAGGACTATTATTTCAAATCCGGTGGTTATCTACTGACAAGTCAGTGGAT TAAT CAAGCT^ 

45 TGAATGCTAGTGGTGCCAAAGTACAGCAAGGTTGGCTTTTTGACAAACAATACCAATCnTGGrm 

AAATGGAAAGTATGCTGATAAAGAATGGATTTTCGAGAATGGTCACTATTATTATCTAAAATCCGGTGGCT 
GCAGCCAATGAATGGATTTGGGATAAGGAATCTTGGTTTTATCTCAAATTTGATGGGAAAATGG 
TGGGTCTACGATTCrCATAGTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGAm 
GGGATAAGGAATCTTGGTTTTATCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGAT^ 

50 GTCAAGCTTGGTACTACTTCAAATCCGGTGGTTACATGACAGCCAATGAATGGATTTGGGATAAGGAATCTTG 
TTACCTCAAATCTGATGGGAAAATAGCTGAAAAAGAATGGGTCTACGATTCTCATAGTCAAGCTTGGTACTA 
AAATCTGCTGGCTACATGGCGAAAAATGAGACAGTAGATGGTTATCAGCITGGAAGCGATGGTAAATGGCTTGGA 
GGAAAAACTACAAATGAAAATGCTGCTTACTATCAAGTAGTGCCTGTTACAGCCAATGrrTATGATTCAG 
AAAAGCTTTCCTATATATCGCAAGGTAGTGTCGTATGGCTAGATAAGGATAGAAAAAGTGATGACAAGC GCTTG 

55 CTATTACTATTTCTGGTTTGTCAGGCTATATGAAAACAGAAGATrTACAAGCGCTAGATGCTAGTAAGGAC^^ 

cccttattatgagagtgatggccaccg n 1 i i atcactatgtggctcagaatgctagta tccca gtagctrctcat 
crrtctgatatggaagtaggcaagaaatattattcggcagatggcctgcattttgatggttt^ 
ccrrrccttttcaaagatttaac^igaggcracaaactacagtgctgaagaattggat^ 
cattaacaatagccttttggagaacaagggcgctacttttaaggaagccgaagaacattac^ 
60 tatctccttgcccatagtgccctagaaagtaactggggaagaagtaaaattgccaaagataagaa taar i ici i io 
gcattacagcctatgatacgaccccttacctttctgctaaga catt tgatgatgtggataagggaattt^ 

AACCAAGTGGATTAAGGAAAATTATATCGATAGGGGAAGAACTTTCCTTGGAAACAAGGCTTCTGGTATGAATGT 

GGAATATGCTTCAGACCCTTATrGGGGCGAAAAAATTGCTAGTGTGATGATGAAAATCAATGAGAAGCTAGCT 

CAAAGATTAG 

65 
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ATGAAAAAAGTATTACAAAAATATTGGGCATGGGCrrrTGTGCrrCATCCCCCTCTTCTr 
ATGTGCCGATGTTTCAAGGAGCCTTrrACAGTTTTACCAACTGGACAGGATTGA CTTA TAACT^ 
CTTAAACAACTTrAAGCTCCTCrrCATGGATCCAAAATTCATGAATGCGATTG^ 
5 GCCATGGTGGTTGGTGAGATTGCACTCGGGATCTTCATTGCGCGTGTCTTGAi^ 

TCTTCCGTGCrrG O 'r i ' C I ICU CCCAGCTGTTTTATCTGGT rrGA CAGTGGCriTGAT 

TACGGTCTTCCAGCGATTGGAAATGCCCrrCATATTGAATTTTTCCAAACCAGTCT^ 

CAATCTTTGCGGCTGTCrrTGTCCTTCTTTGGCAAGGGGTGGCTATGCCCATCATC^^ 

TCTATTCCAACTGAGArrACAGAGGCAGCAAGGATTGATGGTGCGACTAGCAAGCAAGrmCT GGAA CATTGAA 
10 TTGCCTTACTrGCTACCAAGTGTCTCTATGGTCTTTATCCTAGCCCTAAAAGGTGGGCT^ 

CTTrGCCATGACCGGTGGTGGTCCAAACAATGCCACAACCTCACTTGGGCTCrrGGTTTAT^ 
AACAACCAATTCGGTTATGCCAATGCCATTGCCGTAATCnTGTTCTTCTTAArrGTAGTGAT^ 
GAGAGTATCTAAGAAATTTGAAATTTAA 



ATGATGAAACAAGATGAAAGAAAAGCCCTGATTGGCAAATACATTCTATTGATTCTAGG ATCGG T TCTGA TT^ 
TGCCGCTCCTTGCTACCCTCnTrAGTTCCTTCAAACCCACTAAGGATATTGTAGATAAT^ 
AACTTCACATGGGACAACTTTAGCCGTCTCTrAGCTGATGGGATTGGAGGCrrATTATTG^ 
CTGTCrrGTCriTACnTGCAGTAATGATCTTTATCCCTATGGCAGCCTACTCCATCGCTCGC 
20 AAAAGCCTTTACCATCATGTATACCCTCTTAATCCTCGGAATCTTCGTACCTTTCCAAGTCAT^ 
CGGTTATGATGAGTAAACTCGGTTTGGCTAATACCriTGGTTTGATCTTGCTCTACr^ 

ACCCTCTrrCTCTATGTTGGCTATATCAAAATCTCGATTCCAGAAAGTCTGGATGAAGCAGCAGAGATCG^^ 
CTAATCAATTTACAACCTATrrCCGCATCATCTTCCCAATGATGAAACCGATGCATGCGACAACCATGATCATCA^ 
TGCCCTTTGGTTCTGGAATGACrrCATGTTGCCACTCCTTGTCTTGAACCGGG^ 
25 TGTTCCAATACAACTACGCAGGCCAATATTTCAACGACTACGGACCAAGCTTTGCCTCTTACGTGCTCGGCAT^ 
CAGTATCACCATTGTCTATCTCrrCTTCCAACGCCATATCATTTCAGGAATGAGCAACGGGGCAGTGAAGT 

4119.4 

ATGAAAAGTATTCTTCAGAAAATGGGGGAGCATCCGATGCTGLl iLli 1 i iCl i AG CTATAGTACT GTTATATCCA 
30 TTCTTGCACAAj^ATTGGATGGGTCrrGTGGarrCAGTAGGAATGTTTCTATTTACT^ 

TCG ATTTTATCCCATAAATTCTTTCGATTG ATTTTGCAGTTT GTC^ I'l' 1 GCC 

AGTTTAGAACATTTCCAAATTGTGAAGAAArTTAACTATGCTTTTCTTTCACC^ 

ACCGGGCAGAAGTGAC Cn C Vi i AATCCTAATTATTATGGAATTATTTGrrGTTTCTGTATTATG ATTGCTTTCT AT 
CTGTTTACAACGACCAAGTTGAATTGGTTGAAAGTATTCTGTGTGATTGCAGGC mGTTA ATCTC^ 

3 5 CTTTACTCAAAATCG AACTGCCTTTCCTGCTATTATCGCTGG AGC AATTATCTATCT 

GGAAGGCCTTTTGGCTTAGTATTGGGGTCrrCGCGATTGGTTTGAGTTTCCTC^^ 
ATGGGTACTrrAGACTCTTCTATGGAAGAACGCATTrCTATCTGGGATGCTGGGATGGCCTTGTTT 
CTTTTTGGGGTGAAGGGCCATTGACCTATATGAACTCTTATCCTCGGATACATGCTCCTTAT^ 
CAGTCTTTATATTGATACGATTCTGAGrrACGGAATTGTGGGGACTATTrrArrAG^ 

40 TTCGCTTGATGATGGATATGAGTCAGGAGTCGGGGAAACGTCCGATTATCGGCCTTTATCrATCTTTCCTTACAGT 
GGTTGCTGTGCACGGAA ITI ri GACTTGGCTCTCTTCTGGATTCAGTCAGGCTrTA'i'ri'i'Ci 1 GCTAGTTATGTGCA 
GTATTCCATTGGAGCATCGAATGTTGGTATCGGACATGACGGATTAA 

4120.1 

45 ATGTCAAAGATGGATGTTCAGAAAATCATTGCACCGATGATGAAGTTTGTGAATATGCGTGGCATTATAGCTCTAA 
AAGATGGOATGTTAGCAATTrrGCCATTGACAGTACTTGGTAGTTTGTTCTTGATTATGGGACAATTGCCGTTC^ 
AGGATTAAATAAGAGCATTGCTAGTGTTTTTGGAGCTAATTGGACAGAGCCGTTTATGCAAGTAT^ 
TTTGCTATTATGGGTCTAATTTCTTGTTTTTCAATTGCCTATTCTTATGCTAAGAATA 
AGCrGGAGTTCTATCTGTATCrGCATTCTTrATTTTGCTAAGATCATCTTATATCCCTAA^ 

50 GGGACGCTATTAGTAAAGTTTGGrrTGGAGGCCAAGGAATTATCGGTGCTATCArrATAGGTTTGGTAGTAGG 
TATTTATACCrrcriTATAAAGAGAAAAATTGTTATTAAGATGCCAGAACAAGTTCCACAAG 
rrrGAAGCAATGATTCCAGCATTTGTAArrilCTTATCTTCTATGATTGTATATATTTT 
TGGCGGAACATTCATAGAAATGArrrATTCrGCTATTCAAGTTCCGTTGCAAGGTTTAACTGGATC^^ 
GCrATTGGAATTCCATTCTTTATATCAi 1 i I i GTGGTGCTTTGGTGTTCATGGGCAATCGGTAGTAAATCGAGTAGT 

55 G ACAGCTCTGCl 1 1" I ATCTAATCnTGATGCTAATAAAGCTATGTTACCCTCTGCTAATCTATCATTAGAAAATGGT 

GCACATATTGTTACTCAACAA I i i 1 i AGArrCA lUnU - I 'AATTCTATCAGGTTCAGGGATTACGTTTGGTCTTGTAGT 
TGCCATG Cl i 1 1 i GCAGCAAAATCAAAACAATACCAAGCCTTAGGAAAAGTTGCAGCTTTTCCAGCAATATr^ 
GTAAATGAGCCAGTTGTATTTGGATTTCCGATTGTCATGAATCCAGTTATGTTTGTACCrrrCAT^ 
ACTTGCAGCTGTGATAGTATATGGAGCTATTGCAACAGGTTTCATGCAGCCATTCTCAGGGGTAACATTGCCT^ 

60 AGTACACCAGCTATTTTATCAGGAriU-rrGGTGGGTGGATGGCAAGGAGTTATTACTCAGCTGGTGATATTAGCGA 
TGTCTACATTGGTTrATTTrCCATTCTTTAAAGTACAGGATCGTTTAGCTTACCA^ 



4121.2 

ATGAAGAAAAAGGACTTAGTAGACCAACTAGTCTCAGAGATCGAGACGGGGAAAGTCAGGACACTGGGAA TATA 
65 CGGTCATGGAGCrrCAGGTAAATCAACCTrrGCACAGGAATTGTACCAAGCTTTAGATTCTACTACAGTAAAT^ 
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CTAGAGACAGATCCTTATATCACCTCAGGACGCCATCTGGTAGTACCCAAGGACGCGCCGAATCAAAAGGTGACA 
GCCAGTCTGCCAGTGGCGCATGAACTGGAGAGTTTGCAGAGAGATATCCTTgCTTGCAGGCGGGTATGGATGTCTT 

GA 

5 

4122.1 

ATGAAGAAAAGATACCTAGTCrTGACAGCTTTGCTAGCCTrGAGTCTAGCAGCTTGT^^ 

AATGAAGATGGAGAAACTAAGACAGAACAGACAGCCAAAGCTGATGGAACAGTCGGTAGTAAGTCTCAAGGAGC 
TGCCCAGAAGAAAGCAGAAGTGGTCAATAAAGGTGATTACTACAGCATTCAAGGGAAATACGATGAAATCATCCT 

10 AGCCAACAAACACTATCCATTGTCTAAAGACTATAATCCAGGGGAAAATCCAACAGCCAAGGCAGAGrrGGTCA^ 
ACTCATCAAAGCGATGCAAGAGGCAGGTTrCCCTATTAGTGATCATTACAGTGGTTTTAGAA 
ACCAAGCTCTATCAAGATrATGTCAACCAAGATGGAAAGGCAGCAGCTGACCGTTACTCTGCCCGTCCTGGCTA^ 
AGCGAACACCAGACAGGCTTGGCCTTrGATGTGATTGGGACTGATGGTGATTTGGTGACAGAAGAAAAAGCAGCC 
CAATGGCTCTTGGATCL^TGCAGCTGATrATGGCTTTGTrGTCCGTTATCTCAAAGGCAAGGAAAAGGA^ 

15 aTATGGCTGAAGAATGGCACCTGCGTTATGTAGGAAAAGAAGCTAAAGAAj\TTGCTGCAAGTGGTCTCAGTTTGG 
AAGAATACTATGGCTTTGAAGGCGGAGACTACGTCGATTAA 



ATGCGTAAATTCTTAATTATTTTGTTGCTACCAAGTTTTTTGACCAT^ 
20 TCGTCTATACrrCGAAAGAAATTTATTACCTrTCACAATCTGACrrrGGTATTTATm 

CCCATGGTTTATGGAGAGGTTCCTGTTTATGCGAATGAAGATTTAGTAGTGGAATCTGGGAAATTGACTCCC/^^ 
CAAGTTTTCAAATAACCGAGTGGCGCTTAA-^TAAACAAGGAATTCCAGTATTTAAGCTATCAAATC^ 
AGCTGCGGACAAACGArnTTATATGATCAATCAGAGGTAACTCCAACAATAAAAAAAGTATGGTTAGAATCTGA 
CTTTAAACTGTACAATAGTCCTTATGATTTAAAAGAAGTGAAATCATCCTTATCAGCTTATTCGCA^ 
25 GACAAGACCATGTTTGTAGAAGGAAGAGAATtTCTACATATTGATCAGGCTGGATGGGTAGCTA AAGA ATCAACT 
TCTGAAGAAGATAATCGGATGAGTAAAGTTCAAGAAATGTTATCTGAAAAATATCAGAAAGATTCrrr CT 
ATGTTAAGCAACTGACTACTGGAAAAGAAGCTGGTATCAATCAAGATGAAAAGATGTATGCAGCCAGCUrri'lGA 
AACTCTCTTATCTCTATTATACGCAAGAAAAAATAAATGAGGGTCTTTATCAGTTAGATACGA 
ATCTGCAGTCAATGATTTTCCAGGTTCTrATAAACCAGAGGGAAGTGGTAGTCTrCCTAAAAj^ 
30 AGAATATTCTTTAAAGGATTTAATTACGAAAGTATCAAAAGAATCTGATAATGTAGCTCATAATCTArrG 

TACATTTCAAACCAATCTGATGCCACATTCAAATCCAAGATGTCTGCCATTATGGGAGATGATTGGGATCCAAAAG 
AAAAATTGATTTCITCTAAGATGGCCGGGAAGTTTATGGAAG CTATT TATAATCAA 

TTTGACTAAAACAGATTTTGATAGTCAGCGAATTGCCAAAGGTGTrrCTGTTAAAGTAG 

CTCATAAAATTGGAGATGCGGATGAATrrAAGCATGATACGGGTGTTGTCTATGCAGATrCTCCATTTATTCTTrCT 
3 5 ATTTTC ACT AAG AATTCTG ATT ATGAT ACG ATTTCTAAG ATAGCC AAGG ATGTTTATG AGGTTCT A^ 

4125.7 

ATGAAAAAACAAAATAATGGTTTAATTAAAAATC Crrn CTATGGTTATTAriTAT Cl i 11 1 CCTTGTG ACAGGATT 
CCAGTArrrCTATTCTGGGAATAACTCAGGAGGAAGTCAGCAAATCAACTATACTGAGTTGGTACAAGAAATTAC 
40 CGATGGTAATGTAAAAGAATTAACTTACCAACCAAATGGTAGTGTTATCGAAGTTTCTGGTGTCTATAAAAATCCT 
AAAACAAGTAAAGAAGAAACAGGTATTCAGTTTTTCACGCCATCTGTTACTAAGGTAGAGAAATTTACCAGCACT 
ATTCTTCCTGCAGATACTACCGTATCAGAATTGCAAAAACTTGCTACTGACCATAAAGCAGAAGTAACTGTT^^ 

atgaaagttcaagtggtatatggattaatctactcgtatccarrgtgccatttggaattctattcttcttcct^ 
tctatgatgggaaatatgggaggaggcaatggccgtaatccaatgagttttggacgtagtaaggctaaagcagca 

45 aataaagaagatattaaagtaagatitrcagatgttgctggagctgaggaagaaaaacaaga actag ttga^ 

gttgagttcrraaaagatccaaaacgattcacaaaacttggagcccgtattccagcaggtgttcttttggagggac 
ctccggggacaggtaaaacrrrgcrrgctaaggcagtcgctggagaagcaggtgttccattctttagtatctcagg 
ttctgacnttgtagaaatgttrgtcggagtrggagctagtcgtgttcgctcrcttrttg^^ 
gcaccagctatcatctttatcgatgaaattgatgctgttggacgtcaacgtggagtcggtctcggcggaggtaatg 

50 acgaacgtgaacaaaccttgaaccaacj i 1 igattgagatggatggttttgagcgaaatgaagggattatcgtca 
tcgctgcgacaaaccgttcagatgtacrrgaccctgccljl i 1 1 gcgtccagg acg 1 i \ vgatagaaaagtattcgt 
tggtcgtcctgatgttaaaggtcgtgaagcaatcttgaaagttcacgctaagaataagc 

CTTTAGCAGAAGATGTTGATTTGAAATTACTGGCTCAACAAACTCCAGGCTTTGTTGG^^ 

CrrGAATGAAGCAGCTrTAGTrGCTGCTCGTCGCAATAAATCGATAATTGATGCTTCAGATATTGATGAAGCAGAA 
55 GATAGAGrrATTGCTGGACCTTCTAAGAAAGATAAGACAGTTTCACAAAAAGAACGAGAATTGGTTGCTTACCAT 
GAGGCAGGACATACCATTGTTGGTCTAGTCTTGTCGAATGCTCGCGTTGTCCATAAGGTTACAATTGTACCACGCG 
GCCGTGCAGGCGGATACATGATTGCACTTCCTAAAGAGGATCAAATGCTTCTATCTAAAGAAGATATGAAAGAGC 
AATrGGCTGGCTTAATGGGTGGACGTGTAGCTGAAGAAATTATCrrTAATGTCCAAACCACAGGAGCTTCAAACG 
ACrrTGAACAAGCGACACAAATGGCACGTGCAATGGTTACAGAGTACGGTATGAGTGAAAAACTTGGCCCAGTAC 
60' AATATGAAGGAAACCATGCTATGCTTGGTGCACAGAGTCCTCAAAAATCAATTTCAGAACAAACAGCTTATGAAA 
TTGATGAAGAGCTTCGTTCATTATTAAATGAGGCACGAAATAAAGCTGCTGAAATTATTCAGTCAAATCGTGAAAC 
TCACAAGTrAATTGCAGAAGCATTATTGAAATACGAAACATTGGATAGTACACAAATTAAAGCTCTTTACGAAAC 
AGGAAAGATGCCTGAAGCAGTAGAAGAGGAATCTCATGCACTATCCTATGATGAAGTAAAGTCAAAAATGAATGA 
CGAAAAATAA 

65 
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ATGAGGGAACCAGATTTTTTAAATCATTITCTCAAGAAGGGATATT^ 

TirCTGGTGGATTAGATrCCATGrrrCTATTtAAGGTATrGTCT-ACTrATCAAAAAGAG^^ 
CrAGCTCATGTGAATCATAAGCAGAGAATTGAATCAGATTGGGAAGAAAAGGAATTAAGGAAG^^ 
5 GCAGAGCTTCCTATrrATATCAGCAATTrrrCAGGAGAATTrrCAGAAGCGCGT 

rrrrrCAAGAGGTCATGAAAAAGACAGGTGCGACAGCrrrAGTCACTGCCCACCATGCT 

CGATITITATGCGCTTGATrCGAGGAACTCGCTTGCGCTATCTATCAGGAArrAAG^ 

GATAGAAATCArrCGTCCCrrCTTGCATTTTCAGAAAAAAGACm 

AATCAGGAGAATCATTATrrrCGAAATCGTATTCGAAATTCTTACTTACCAGAATTGGA/^ 
1 0 TTAGGG ATGCAATCTT AGGCATrGGCAATG AAATTTT AG ATTATG ATITGGCAAT^ 

TAATGTGGAAGATTrACAGCAGTTATTrrCrrACTCTGAGTCTACACAAAGAGTTr^ 
GTTTTCCAGATTTGAATCTTACAAAAGCTCAGTTrGCTGAAGrrCAGCAGATTTTV^^ 
TCATCCGATTAAAAATGGCTATGAATTGATAAAAGAGTACCAACAGTTTCAGATTTGTAA^ 
GATGAAAAGGAAGATGAACTTGTGTTACACTATCAAAATCAGCTAGCTTATCAAGGATATTrAi 1 i lu 1 1 iGGAC 
15 rrCCATTAGAAGGTGAATTAATTCAACAAATACCTGTTrCACGTGAAACATCCATACACATT^^ 
AGGAGATGrnTGArrAAAAATGGGCATAGAAAAAAACTCAGACGTTrATTTATTG 
ATTTGAAAATCCCTATGGAAAAGAGAAACTCTGCTCTTATTATrGAGCAArTTGGTG^ 
AATTGCGACCAATAATTTGAGTAAAAAAACGAAAAATGATATAATGAACACTGTACTTTATATAGAA^ 



20 



TAGGTAA 



ATGAAGCGTTCTTCTCTTTrAGTTAGAATGGTTATTTCCATCT^ 

ACrrrCTACTATCAATCAACTrCTrCAGCCATTGAGGCCACCATTGAGGGCAACAGCCAAACGACCATCAGCCAG 
ACTAGCCACTTTArrCAGTCTTATATCAAAAAACTAGAAACCACCrCGACTGGTT^ 

25 TGGCCTATGCTGAGAATCCCAGTCAAGACAAGGTCGAGGGAATCCGAGATTTGTTTTTGACCATCTTG^ 
TAAGGACrTGAAAACTGTTGTGCTGGTGACCAAATCTGGTCAGGTCATrrCrACAGATGACAG^ 
AACTTCCTCTGATATGATGGCTGAGGATTGGTACCAAAAGGCCATTCATCAGGGAGCTATGCCTUi 1 i IGACTCCA 
GCTCGTAAATCAGATAGTCAGTGGGTCATrrCTGTCACrCAAGAACTTGrrGATGCAAAGGGAG^ 
TGCrrCGTITGGATATTTCTTATGAAACTCTGGAAGCCTATCTCAATCAACT 

30 cirCATTATCAATGAAAACCATGAATTTGTCTACCATCCTCAACACACAGTTTATAGTrCGTCT^ 

GCTATGAAACCCTACATCGATACAGGTCAGGGTrATACTCCTGGTCACAAATCCTACGTCAGTCAAGAGAAGATT 
GCAGGAACTGATTGGACGGTGCTTGGCGTGTCATCATTGGAAAAGTTAGACCAGGTTCGGAGTCAGCTCTTGTGG 
ACCTTGCTTGGGGCCAGTGTCACATCTCTTCTTGTCTGTCTCTGCnTAGTGTGG^ 
TCCTTrGAAGGATrrGAGAGAAACCATCTTGGAAATTGCTTCTGGTGCTCAAAATCTTCGTG 

35 GCCTATGAACTGAGAGAAGTAACTCGCCAATTTAATGCTATGTTGGATa\GATrGATCAGTrGATGGTA^ 

GTAGCCAGGAAGAAACGACCCGTCAGTACCAACrrCAAGCCCTTrCGAGCCAGArTAATCCACATrrCCTCT^ 

CACITTGGACACCATCATCTGGATGGCTGAATTrCATCATAGTCAGCGAGTGGTGCAGGTGACCAACTCCTTG^^ 

ACCTATTTCCGCrrGGCGCTCAATCAAGGCAAGGACTTGATTTGTCTCTCTGACGAAATCA^^ 

atctctttatccagaaacaacgctatggagataagctggaatacgaaattaatgaaaatgttgcctt^ 
40 agtcttacccaagctggtcctacaaccccttgtagaaaatgctctttaccatggcattaaggaaaagg 

GGGCCATATTAAACTTTCTGTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGCGrrGGCT^ 
CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGTTGGTCTTCAAAATGTCGATCAACGGCTC 

aaacttcattttggagccaattaccatatgaagattgattctagaccccaaaaagggacgaaagt^ 
ataaatagaatagaaactagctaa 

45 

4126.7 - 

atgaagcgttcttctcttttagttagaatggttatttccatcmctggtcm 

actttctactatcaatcaagttcttcagccattgaggccaccattgagggcaacagccaaacgaccatcagccag 

50 actagccactttattcagtcttatatcaaaaaactagaaaccacctcgactgg tttgac ccagcagacgga^ 
tggcctatgctgagaatcccagtcaagacaaggtcgagggaatccgagatttgtttttgaccatct^ 
taagcacttgaaaactgttgtgctggtgaccaaatctggtcaggtcatitctacagatgacagtg tgca gatgaa 
aacntcctctgatatgatggctgaggattggtaccaaaaggccattcatcagcgagcratgcctgttttg 
gctcgtaaatcagatagtcagtgggtcatttctgtcactcaagaacrrgttgatgcaaagggagccaat c^ 

55 tgcrrcgtttggatatttcrratgaaactctggaagcctatctcaatcaacrccagttggggcag^^ 

cttcattatcaatgaaaaccatgaatrrgtcraccatcctcaacacacagtttatagttcgtctagcaaaatg 
gctatgaaaccctacatcgatacaggtcagggttatactcctggtcacaaatcctacgtcagtcaagagaagatt 
gcaggaactgattggacggtgcttggcgtgtcatcattggaaaagttagaccaggttcggagtcagctcttgtgg 
accttgcrrggggccagtgtcacatctcttcrrgtctgtctctgcttagtgtggt^^ 

60 tcctttgaaggatttgagagaaaccatgttggaaattgcrrctggtgctcaaaatcttcgtc 

gcctatgaactgagagaagtaactcgccaattraatgctatgttggatcagattgatcagttgatggtagctat^^ 
gtagccaggaagaaacgacccgtcagtaccaacttcaagcccttrcgagccagattaatccacatttcctctati^ 
cacrrtggacaccatcatctggatggctgaatttcatgatagtcagcgagtggtgcaggtgaccaagtccttggca 
acctatttccgcttggcgctcaatcaaggcaaggacttgatttgtctctctgacgaaatcaatcat^ 

65 atctctttatccagaaacaacgctatggagataagctggaatacgaaattaatgaaaatgttgcctttgataa 
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AGTCTTACCCAAGCTGGTCCTAC>^CCCCnTCrrAGAAAATGCTCTrrACCATGGC^ 
GGGCCATArrAAACTTrCr.GTCCAGAAACAGGATTCGGGATTGGTCATCCGTATTGAGGATGATGGC^ 
CAAGATGCTGGTGATAGTAGTCAAAGTCAACTCAAACGTGGGGGAGrrGGTCTTCAAAATCTCGATCAACGGCTC 
AAACTTCATTrrGGAGCCAATTACCATATGAAGATTGATTCTAGACCCCAAAAAGGGACGAAAGTTGAAATATAT 

5 ATAAATAGAATAGAAACTAGCTAA 



ATGTTTrrTAAATTATTAAGAGAAGCTCTTAAAGTCAAGCAGGTTCGATCAAAAAT^ 
TTTGGTCTrrCGTATCGGAACTAGCATTACAGTTCCTGGTGTGAATGCCAAT^^ 
1 0 CCTTCTT AAAC ATGTTG AGCTTGGTGTCGGGG AATGCCCT AAAAAACITTTCG AT^^ 

CTATATCACCGCTTCTATrGTrGTCCAACTCTTGCAAATGGATATrrTACCCAAGTrrGT AGAG TGGGC^ 

GGGGAAGTAGGTCGAAGAAAATTGAATCAAGCTACTCCrrATATTGCTCTAGTTCTCGCTm 

GGATTACAGCTGGTrrrAATACCTTGGCTGGAGCTCAArrGATTAAAACTGCTTTAACT 

GArrGGTATCATCTTAACAGCTGGTAGTATGATTGTCACrTGGTTGGGTGAGCAAArrACAGATAAGGGATACGGA 
15 AACGGrGrn-CCATGATTATCTTTGCCGGGATTGrrrCCTCAATTCCAGAGATGATTCAGGGCATCTATOT^ 
ACrrTGTGAACGTCCCAAGTAGCCGTATCACTTCATCTATCATTTrCGTAATCATm 
ATrATrrACTTTACAACTTATGrrCAACAAGCAGAATACAAAATTCCAATCCAATATACT^ 
GTCCATCTAGCTCTTACCTTCCGTTAAAAGTAAACCCTGCTGGAGTTATCCCTGTTATCrr^ 
GCAGCGCCTGCGGCTATTCTTCAGTTTTTGAGTGCCACAGGTpVTGA 
20 TGTTGGCAACTACTTCTCCAACTGGTATTGCCATGTATGCrrrGTTGATTATTCTCT^ 

TACAGATTAATCCTGAAAAAGCAGCAGAGACCTACAAAAGAGTGGTGCCTATATCCATGGAGTTCGTCCTGGTAA 
AGGTACAGAAGAATATATGTCTAAACTTCTTCGTCGTCTTGCAACTGTTGGTTCCCTCTTCCT^ 



ATGGATATTAGACAAGTrACTGAAACCATCGCCATGATTGAGGAGCAAAACTTCGATATTAGAACCATTACCATG 
GGGATTTCTCTTTTGGACTGTATCGATCCAGATATCAATCGTGCTGCGGAGAAAATCTATCAAAAAATr 
AGGCGGCTAATTTAGTAGCTGTTGGTGATGAAATTGCGGCTGAGTTGGGAATTCCTATCGTTAATAAGCGTGTA^ 
GGTGACACCTATTTCTCTGATTGGGGCAGCGACAGATGCGACGGACTACGTGGTTCTGGCAAAAGCGCITGATAA 

30 GGCTGCGAAAGAGATTGGTGTGGACTTTATrGGTGGTTTTTCTGCCTTAGTACAAAAAGGTTATCAA^ 
GAGATTCTCATCAATTCCATTCCTCGCGCTTTGGCTGAGACGGATAAGGTCTGCTCGTCAGTCAAT AT^ 
CCAAGTCTGGTATTAATATGACGGCrGTGGCAGATATGGGACGAATTATCAAGGAAACAGCAA ATCTT TCAGATA 
TGGGAGTGGCCAAGTTGGTTGTATTCGCtAATGCTGTTGAGGACAATCCATTTATGGCG GGTG CCTTTC 
TGGGGAAGCAGATGTTATCATCAATGTCGGAGTTTCTGGTCCTGGTGTTGTGAAACGTGCTTTGGAAAAAGT^ 

35 GGACAGAGCTTTGATGTAGTAGCCGAAACAGTTAAGAAAACTGCCTTTAAAATCACTCGTATCGGTCAAT^ 

GTCAAATGGCCAGTGAGAGACTGGGTGTGGAGTTTGGTATTGTGGACrrGAGTTrGGCACCAACCCCTGCGGT^ 
AGACTCTGTGGCACGTGTCCrrGAGGAAATGGGGCTAGAAACAGTTGGCACGCATGGAACGACGGCTG CCTTG GC 
CCTCTTGAACGACCAAGTTAAAAAGGGTGGAGTGATGGCCTGCAACCAAGTCGGTGGTTTATCTGGTGCa^ 
CCTGTTTCTGAGGATGAAGGAATGATTGCTGCAGTGCAAAATGGCrCTCTTAATTTAGA/^^ 

40 CGGCrATCTGTTCTGTTGGATTGGATATGATTGCCATCCCAGAAGATACGCCTGCTGAAACTATTGCGGCTATGAT 
TGCGGATGAAGCAGCAATCGGTGTTATCAACATCAAAACAACAGCTGTTCGTATCATTCCCAAAGGAAAAGAAGG 
CGATATGATTGAGTTTGGTGGTCTATTAGGAACTGCAC CCGT TATGAAGGTTAATGGGGCTTCGTCTGTCGACTTC 
ATCTCrrCGCGGTGGACAAATCCCAGCACCAATTCATAGTTTTAAAAATTAA 

45 4X28.1 

ATGACACAGATTATTGATGGGAAAGCTTTAGCGGCCAAATTGCAGGGGCAGTTGGCT^ 

AAGGAAGAAACAGGTCTAGTGCCTGGTTTGGTAGTGATTTTGGTTGGGGACAATCCAGCCAGCCAAGTCTACGTT 
CGCAACAAGGAGAGGTCAGCCCTTGCGGCTGGTTTCCGTAGCGAAGTAGTACGGGTTCCAGAGACCATTACTCAA 
GAGGAATTGTTAGACCTGATTGCTAAATACAATCAGGATCCAGCTTGGCATGGGATTTTGGTTCAGTTGCCAT^ 

50 CAAAACACATTGATGAAGAGGCGGTTCTATTGGCTATTGACCCAGAAAAGGATGTGGATGGTTTCCATCCTCTAA 
ACATGGGGCGTCTTTGGTCTGGTCATCCAGTCATGATTCCTTCGACACCGGCAGGAATTATGGAAATGTTCCATGA 
ATATGGGATTGACTTGGAAGGTAAAAATGCAGTCGTCATCGGTCGATCCAATATTGTCGGAAAACCTATGGCCCA 
G CI 1 C n U GGCAAAG AATGCAACAGT AACCTTGACTCACTCACGT ACTC ATAATCTTTCC AAGGTGGCTGC AAAA 
GCAGATATTCTGGTTGTTGCAATCGGTCGTGCCAAGTTTGTGACTGCTGACTTTGTCAAACCAGGTGCGGTAGTCA 

55 TTGACGTTGGGATGAACCGCGATGAAAATGGTAAGCTCTGTGGGGATGTTGATTATGAGGCGGTTGCCCCACTTGC 
TAGCCACATTACGCCAGTCCCTGGAGGTGTCGCTCCTATGACCATTACTATGCTGATGGAGCAAACCTATCAGGCA 
GCACTTAGGACATTCGATAGAAAATAA 

4128.2 

60 ATGTCTAAATTTAATCGTArrCATTTGGTGGTACTGGATTCTGTAGGAATCGGTGCAGCACCAGATGCTAATAA 
TTGTCAATGCAGGGGTTCCAGATGGAGCTTCTGACACACTGGGACACATTrCAAAAACAGTTGGTTTGAATGT^ 
AAACATGGCTAAAATAGGTCTTGGAAATATTCCTCGTGAAACTCCTCTTAAGACTGTAGCAGCTGAAAGC/^ 
AACTGGATATGCAACAAAATTAGAGGAAGTATCTCTTGGTAAGGATACTATGACTGGACACTGGGAAATCATGGG 
ACTCAACATTACTGAGCCTTTCGATACTTTCTGGAACGGATTCCCAGAAGAAATCCTGACAAAAATCGAAGAAT^ 

65 TCAGGACGCAAGGTTATTCGTGAAGCCAACAAACCTTATTCAGGAACGGCTGTTATCTATGATTTTGGACCACG^ 
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AGATGGAAACTGGAGAGTTGATTATCTATACTTCAGCTGACCCTGTTTTGCAGATTGCTOCCC^ 
TCCmGGATGAATrCTACCGTATCTGTG.AATACGCTCGrrCGATTACCCTTGAGCGTCCTGCC 
TCATTGCTCGCCCTTATGTAGGTGAACCAGGTAACTTCACTCGTACGGCAAACCGTCGTGACT^ 
ATTTTrCCCAACTGTTTTGGATAAATTGAATGAGGCTGGTATCGATACTTATGCTGTGGGTAAAATCA^ 
5 TTTAACGGTGCTGGTATCAACCATGACATGGGTCACAACAAGTCAAATAGTC^TGGAATTGA TACAC T^^ 
ACTATGGGACTTGCTGAGTTTGAAAAAGGATTCTCATTCACAAACCTAGTTGACTTTGATGCCC^^ 
GTCGTAATGCTCACGGTTACCGTGATTGCTTGCATGAGTTTGATGAACGCTTACCTGAAATTATCGCAGCTA^^ 
AGAGAATGACCTTCTCTTGATTACTGCGGACCATGGAAATGACCCAACGTATGCAGGAACGG ATCA CACTCGGGA 
ATATATTCCATTGTTGGCCTATAGCCCTGCCTTTAAAGGAAATGGTCTCATTCCAGTAGGACATm 
10 CAGCGACrCTTGCCGATAACTTTGGTGTGGAAACTGCTATGATTGGGGAAAGTTTCT^ 

4129.2 

ATGrrTATTTCCATCAGTGCTGGAATTCTGACATTTrTACTAACTTTAGTAGA 

TAGAAAGGCGCAAATTACAGGCCAGCAGATGCATGAGGATGTCAAACAGCATCAGGCAAAAGCTGGGACTCCTA 
1 5 CAATGGGAGGTITGG in ' lUCri 'GATTACTTCTGTTTTGGTTG CrriClU ' rrr CGCCC^^ 

ATAATGTGGGAATGATTTTGTTCATCrrGGTCTTGTATGGCTTGGTCGGATTTITAGATGA^^ 
CGTAAAATCAATGAGGGGCTTAATCCTAAGCAAAAATTAGCTCTTCAGCTTCTAGGTGGAGTTA TOT 
TCTATG AGCGCGGTGGCG ATATCCTGTCTGTCTTTGGTT ATCCAGTTCATTTGGG A i ' 1 1 i 1 GTATATTTTCTrCGCT 
CTTTTCTGGCTAGTCG Cji 1 1 1 i CAAACGCAGTAAACTTGACAGACGGTGTTGACGGTTTAGCTAGTATTTCCGTTGT 
20 GATTAGTTTGTCTGCCrATGGAGTTATTGCCTATGTGCAAGGTCAGATGGATATTCnTCTAGTGATT< ^ 
TTGGTGGTTTGCTCGGTTTCTTCATCrrrAACCATAAGCCTGCCAAGGTCTTTAT^ 

CTAGGTGGGATGCTGGCAGCTATCTCTATGGCTCTCCACCAAGAATGGACTCTCTTGATTATC GGAA TTGTGTATG 
1 r ri ' 1 G AAACAACTTCrGTTATG ATGCAAGTCAGTTATTTCAAACTG ACAGGTGGTAAACGTATTTTC^ 
GCCTGTACATCACCATTrrGAGCTTGGGGGATTGTCTGGTAAAGGAAATCCTTGGAGCGAGTGGAACGT^ 
25 TTCTTrrpGGGAGTGGGACTTCTAGCAAGTCTCCTGACCCTAGCAAl I i'l ATATTTGATGTAA 

4133.1 

TTGTTTAAGAAAAATAAAGACATTCTTAATATTGCATTGCCAGCTATGGGTGAAAACTT^^ 
GAATGGTGGACAGTTATTTGGTTGCTCATTTAGGATTGATAGCTATTTCAGGGGTTTCAGTAGCTGGT^ 

30 CACCATrTATCAGGCGATTTTCATCGCTCrrGGGAGCTGCTATTTCCAGTGTTATTTC^^ 

GACCAGTCGAAGTTGGCCTATCATGTGACTGAGGCGTTGAAGATTACCTTACTATTAAGTTTCCT^ 
GTCCATCTTCGCTGGGAj^AGAGATGATAGGACriTrGGGGACGGAGAGGGATGTAGCTGAGAGTGGTGGACTGTA 
TCTATCTTTGGTAGGCGGATCGATrGTTCTCTTAGGTTTAATGACTAGTCTAGGAGCCTTGATTCGTGCA^ 
AATCCACGTCTGCCTCTCTATGTTA G ' rri ' 1 ' l ' l 'ATCCAATGCCTTGAATATT Cl 1 1 U I CAAGTCTAGCTAri'i'lTGTT 

35 CTGGATATGGGGATAGCTGGTGTrGCTTGGGGGACAATTGTGTCTCGTTTGGTTGGTCTTGm 

AATTAAAACTGCCTTATGGGAAGCCAACl 1 i iGGTTTAGATAAGGAACTGTTGACCTTGGCTTTACCAGCAGCTGG 
AGAGCGAGTrATGATGAGGGCTGGAGATCTAGTGATCATTGCCTTGGTCGTTrCTTTTGGGACGGAGGCAGTTGCT 
GGGAATGCAATCGGAGAAGTCTTGACCCAGrTTAACTATATGCCTGCCTTTGGCGTCGCTA CGGCAA C GGTCA TG^ 
TGTTGGCCCGAGCAGTTGGAGAGGATGATTGGAAAAGAGTTGCTAGTTTGAGTAAACAAACCrr^ 

40 GTTCCTCATGTTCCCCCTGTCCrrrAGTATATATGTCTTGGGTGTACCATrAACTCATCTCTATACGA 

TAGCGGTGG AGGCT AGTGTTCTAGTG ACACTG i ' 1 ' 1 ' 1 CACTACTTGGG ACCCCTATG ACG ACAGG AACAGTCATCTA 
TACCGCAGTCTGGCAGGGATTAGGAAATGCACGCCTCCCTTTTTATGCGACAAGTATAGGAATGTGGTGTATCCGC 
ATTGGGACAGGATATCTGATGGGGATTGTGCTTGGTTGGGGCTTGCCTGGTATTTGGGCAGGGTCTCTCTTGGATA 
ATGGTTTTCGCTGGTTATTTCTACGCTATCGTTACCAGCGCTATATGAGCTTGAAAGGATAG 

45 

4135.2 

ATGCAAACTCAAGAAAAACACTCGCAAGCAGCCGi ICl I GGCTTGCAGCACTTACTAGCCATGTACTCAGGATCT 

ATCCTGGTTCCCATCATGATTGCGACAGCCCTTGGCTATTCAGCTGAGCAGTTGACCrrACCTGATTTCTACAG 

TCTTCATGTGTGGGGTGGCAACCTTCCTCCAACTCCAACTCAACAAATACTTTGGGATTGGACT^ 

50 TGGAGTTGCATTCCAGTCGGTCGCTCCCTTGATTATGATTGGGCAAAGCCATGGTAGTGGCGCTATGTTTGGTGCC 
CTTATCGCATCTGGGATTTACGTGGTTCnO L i TCAGGCATCTTCTCAAAAGTAGCCAATCTCTTCCCATCTATCGT 
AACAGGATCTGTTATTACCACGATTGGTTrAACCTTGATCCCTGTCGCTATTGGAAATATGGGAAATAACGTTCC^ 
GAGCCAACTGGTCAAAGTCTCITGCTTGCAGCTATTACTGTTCTGATTATCCTCTTGATC^ 
GATTTATCAAGTCTATCTCTATTTTGATrGGTCTGGTTGrrGGAACTGCCATTGCTGCTACTATG^^ 

55 TTCTCTCCTGTTGCGGTAGCTCCACTTGTCCATGTCCCAACTCCACTCTACTTTGGGATGCCAACC^ 

ATCTATTGTCATGATGTGTATCATCGCAACGGTGTCTATGGTTGAGTCAACTGGTGTTTATCTGGCCTTGT 
TCACAAAGGATCCAATCGACAGCACGCGCCTTCGCAACGGATACCGCGCAGAAGGTTTGGCCGTACTTCTCGGAG 
GAATCTTTAACACCTTCCCTTACACCGGATTTTCACAAAACGTTGGTTTGGTTAA^ 
CCTGCCAATCTACTACGCAGCTGGrn'CCTGGTTCTCCTrGGACTGCTTCCTAAGTrrGGCGCCCTTGC 

60 TTCCAAGCTCCGTCCTCGGTGGTGCCATGCTGGTAATGTTTG G l' rrr GTATCAATTCAAGGGATGCAAATCCTCGC 
CCGTGrrGACTTTGCTAACAATGAACACAACTTCCTTATCGCAGCTGTTTCAATCGCTGCAGGTGTCG 
AACAGTAATCrCTTTGTCAGCATGCCGACAGCCTTCCAAATGTTCTTCTCAAACGGAATCGTCGTAGCCAGCCT 
TCGCTATTGTCCTCAATGCCGTATTAAATCATAAAAAGAAATAA 

65 4136.2 
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ATGAAAGATAGAATAAAAGAATATTTACAAGACAAGGGAAAGGTGACTGrrAATGArrrGGCTCAGGC^^ 
AAAGACAGTrCCAAGGATrrrCGTGAGTTGATrAAAACCTTGTCCTTAATGGAAAGA^ 
AAGAAGATGGTAGTCTGACArTAGAAATTAAGAAAAAACATGAGATTACCCTCAAGGGGATrm 
AAAATGGCrrrGGCTrrGrrAGTCrCGAAGGCGAGGAGGACGACCrTTTTGTAG^ 
5 TATrGATGGTGATACCGTCGAGGTAGTGATTAAGAAAGTCGCTGACCGCAATAAGGGAACAGCAGCAGAAGCC^ 
AATTATTGATATCCTAGAACACAGTTTGACAACAGTTGTCGGGCAAATCGrrCTGGATCAGGAAAA^ 
GCTGGCTATATrCGrrCAAAAAATCAGAAAATCAGTCAACCGATTTATGTTAAGAAACCAGCCCTA^ 
GGAACAGAAGTTCTCAAAGTCrnATCGATAAATACCCAAGCAAGAAACATGATTrCTr^ 

ATGTAGTGGGACACTCAACGGATCTCGGAATTGATGTTCTTGAGGTCTTGGAATCAATGGACATTGTATCCGA^ 

10 TCCAGAAGCTGTTGTTAAGGAAGCAGAAAGTGTGCCTGATGCTCCGTCTCAAAAGGATATGGAAGGTCGTC^ 
TCTAAGAGATGAAATTACCTTTACCATTGACGGTGCGGATGCCAAGGACrrGGACGATGCAGTGCATA'^^ 
TCTGAAAAATGGCAATCrGGAGTTTCGGGTTCACATCGCAGATGTTTCTTATTATGTGACCGAGGGGTCT^ 
GACAAGGAAGCCCTTAACCGTGCGACTTCTGTrTACGTGACAGACCGAGTGGTGCCAATGCTTCCAGA^^ 
TCAAATGGCATCTGCTCTCTCAATCCCCAAGTTGACCGCCTGACCCAGTCTGCTATTAT 

15 GGAGATTGATAAACATGGTCGTGTGGTCAACTATACCATTACACAAACAGTTATCAAGACCAGTITrCCT^ 
TATAGCGATGTCAATGATATCCrAGCTGGCGATGAAGAAAAGAGAAAAGAATATCATAAAATTG]^ 
GAACTCATGGCCAAGCITCATGAAACrrTAGAAAACATGCGTGTGAAACGTGGAGCTCTCAATT^ 
AAGCGAAGATTTTAGTGGATAAACAAGGTAAGCCTGTTGATATCGTTCTTCGGCAGCGTGGTATTGC^ 
TGATTGAGTCTTTTATGTTGATGGCTAATGAAACAGTTGCCGAACArrrCAGC/^ 

20 CGAATTCACGAGGAGCCTAAGGCTGAAAAGGTTCAGAAGTITATTGArrATGCTTCGAGTm 

ATGGAACTGCCAGTGAGATrAGTCAGGAGGCACTTCAAGACATCATGCGTGCTGTTGAGGGAGAACCTTATGCA^ 
ATGTATTGTCCATGATGCTTCTTCGCTCTATGCAGCAGGCTCGTTATTCGGAGCACAATCACGGCCACTATGGACT 
AGCTGCTGACTATTATACTCACTTrACCAGTCCAATTCGTCGTTATCCAGACCTTCrrGT^^ 
ATTACGGCCGTrCTAAGGAAATAGCAGAGCATTTTGAACAAGTCATrCCAGAGATrGCGACCCAGTCrrC 

25 GTGAACCTCGTGCCATAGAAGCTGAGCGTGAAGTCGAAGCCATGA AAAA GGCTGAGTATATGGAAGAATACGTGG 
GTGAAGAGTATGATGCAGTTGTATCAAGTATTGTCAAATTCGGTCTCTTTGTCGAATTGCCAAACACAGTTGAAGG 
CTTGATTCACATCACTAATCTGCCTGAATTTTATCATrrCAATGAGCGTGATITGACT 

GGTATCACrrrCCGAGTGGGTCAGCAGATCCGTATCCGTGTTGAAAGAGCGGATAAAATGACTGGAGAGArrGAT 
TTTTCATTCGTACCTAGTGAGTTTGATGTGATTGAAAAAGGCTTGAAACAGTCTAGTCGT 
30 AGTGGCAGAGGGCGTGATTCAAATCGTCGTTCGGATAAGAAGGAAGA CAAGAG AAAATCAGGACGCTCAAATGA 
TAAGCGTAAGCATTCACAAAAAGACAAGAAGAAAAAAGGAAAGAAACCTTTTTACAAGGAAGTAGCTAAGA^ 
GAGCCAAGCATGGCAAAGGGCGAGGGAAAGGTCGTCGCACAAAATAA 

35 ATGGGCACAACAGGATTTACAATAATTGACTTAATTATOT 

CTATTTCTCTAAAAAAGAGATGAAAGGAAAAGAGTTCnTTAAAGGAGATGGTTCGGTT 
GTATCCATTrrrGCCACAATGCTCAGTCCGATTTCCrrCTTGGGACTCGCTGGTAGCTCT^ 
TTTATGGTrrGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATOT 
TAGACATCGATACGGCATATGArrACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTrCAGC^ 

40 GTTTATTATrrATCAATTGGGACGTATGTCTATCATTATGTACCTCCCL^TCAGCTGGTTTATCAGTA^^ 
TTGACATCAATATTTTGATrATTTrGATGGGTGTAGTTGCAATTGTTTATTCTT^ 
TTATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTA^ 

TAAAGGTGGCrrrGGTGCAGTAGCAG AAACATTAGCAAACGGG AAATTCCTTGCTGCAAATG AAAAAU^ i i 1 1 CG A 
TCCTAACTTGCTTrCAAACTCCATCTTTTrAATTGTGATGGCT^ 
45 TCAAGATTTGGTTCAACGTmACTACAACACAAAATATTAAGAAACTTAATAAGATGTTGTTCA 
TTGTCACTTGCAACTGCAACAGTCTTTTACTTGATTGGTACAGGCTTGTACGTATTC^^ 
ATAGTGCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTrATTGCATACCAGTTACCACT 
AGGTTTGATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATrTCAACAGGm 

TGGACATTGGATATTCAAGATGTCArrrCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTAT 
50 CTCTAGCAGTAGGTTTATTCTCAATTGGTGTrrCCATTGTCATGGCTCACTCAGATATTAAATCTGCATA 

TTCAATACTrrCATGGGACrrCTACTTGGTCTACTTGGTGGTGTATTTATTCTTGGAl'llO 1 1 CTAAAAAAGCAAA 
TAAACAAGGTGCTTATGCAGCGCTGATTGTATCAACCATCGTCATGGTATTTATTAAATACTTCCT^ 
GCTGTTAGCTACTGGGCATATTCArrGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGT^^ 
TGGAAATAAACTATCTGCACCTAAATATACAACGATrCATGATATTACAGAAArTAAAGCGGATTCAAGTTGGGA 

55 AGTTCGTCACTAA 
4138.1 

ATGAAATTTAGTAAAAAATATATAGCAGCTGGATCAGCTGTTATCGTATCCTTGAGTCTATGTGCCTATGCACTA^ 
ACCAGCATCGTTCGCAGGAAAATAAGGACAATAATCGTGTCTCTTATGTGGATGGCAGCCAGTCAAGTCAGAAAA 

60 GTGAAAACTTGACACCAGACCAGGTTAGCCAGAAAGAAGGAATTCAGGCTGAGCAAATTGTAATCAAA ATTAC AG 
ATCAGGGCTATGTAACGTCACACGGTGACCACTATCATTACTATAATGGGAAAGTTCCTTATGATGCCCTCTT^ 
TGAAGAACTCTTGATGAAGGATCCAAACTATCAACTTAAAGACGCTGATATTGTCAATGAAGTCAAGGGTGGTTA 
TATCATCAAGGTCGATGGAAAATATTATGTCTACCTGAAAGATGCAGCTCATGCTGATAATGTTCGAACTAAAGAT 
GAAATCAATCGTCAAAAACAAGAACATGTCAAAGATAATGAGAAGGTTAACTCTAATGTTGCTGTAGCAAGGTCT 

65 CAGGGACGATATACGACAAATGATGGTTATGTCTTTAATCCAGCTGATATTATCGAAGATACGGGTAATGCTTATA 
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TCGTTCCTCATGGAGGTCACTATCACTACATTCCCAAAAGCGATTTATCTGCTAGTGAATT^ 

ACATCTGGCTGGAAAAAATATGCAACCGAGTCAGTTAAGCTATTCTTCAACAGCTAGTC 

TGTAGCAAAAGGATCAACTAGCAAGCCAGCAAATAAATCTGAj^AA 

TTCACCTAGCGCCCAACGTTACAGTGAATCAGATGGCCTGGTCTTTGaCCCTGC TAAG ATTATCAGTCCT 
5 AATGGAGTrGCGATTCCGCATGGCGACCATrACCACTTTATTCCTTACAGCAAGCTTTCTGCC^ 
TTGCCAGAATGGTGCCTATCAGTGGAACrGGTTCTACAGTTTCTACAAATGCAAAACCTAATGA^ 
TCTAGGCAGTCTTTCAAGCAATC CrrcnCl i i AACGACAAGTAAGGAGCTCTCTTCAGCATCTGATGGTTATATTT 
TTAATCCAAAAGATATCGTTGAAGAAACGGCTACAGCTTATATTGTAAGACATGGTGATCATTTCCA 
AAAATCAAATCAAATTGGGCAACCGACTCTTCCAAACAATAGTCTAGCAACACCTTCTCC^ 
10 CCAGGAACTTCACATGAGAAACATGAAGAAGATGGATACGGATTTGATGCTAATCGTATTATCGCT^ 
TCAGGTTTTGTCATGAGTCACGGAGACCACAATCATTATTTCTTCAAGAAGGACTTGAC^ 
GTGCGCAAAAACATTTAG 

4139.1 

15 ATGAAAAAAAGAGCAATAGTGGCACTCATTGTACTGCTTTTGATTGGGCTGGATCAGTTGGTCAA^ 

TCCAGCAGATTCCACTGGGTGAAGTGCGCTCCTGGATCCCCAATTTCGTTAGCTTGACCTACCTGCAAA^ 

TGCAGCCTTTrCTATCTTACAAGATCAGCAGCTGTTATTCGCTGTCATTACTCTGGTTGTCGTGAT^ 

GGTATTTACATAAACACATGGAGGACTCATrCTGGATGGTCTTGGGTTTGACTCTAATAATC GCGGG 

aaactttarrgacagggtcagtcagggctttgttgtggatatgttcc^ 
20 atgtggcagatagctatctgacggttggagtgattattttattgattgcaatgctaaaagaggaaataaatggaa 

ATTAA 

4139,5 

ATGAATACAAATCTTGCAAGTTTTATCGTTGGACTGATCATCGATGAAAACGACCGTTmAC ^ 

25 ATGGTCAAACCTATGCTCTTGCTAAGGAAGAAGGCCAACATACACTAGGGGATACGGTCAAAGGTTTTGCATAC^ 
CGGATATGAAGCAAAAACTCCGCCTGACAACCTrAGAACTGACTGCCACTCAGGACCAATTTGGTTGGGGAC^ 
TCACAGAGGTTCGTAAGGACTTGGGTGTCTTTGTGGATACAGGCCTTCCTGACAAGGAAATCGTTGTOTCACT 
TATTCTCCCTGAGCTCAAGGAACTCTGGCCTAAGAAGGGCGACCAACTCTACATCCGTCTTGAAGTGGATA^ 
AGACCGTATCTGGGGCCTCTTGGCTTATCAAGAAGACnTCCAACGTCTTGCTCGTCCTGCCTAC^ 

30 AACCAAAACTGGCCAGCCATTGTTTACCGTCTCAAGCTGTCAGGAACTTTTGTTTACCT^ 

TrGGTTTTATTCATCCTAGCGAGCGTTACGCAGAGCCACGTTTGGGCCAAGTATTAGATGCGCGCGTTATTGGm 
CCGTGAAGTGGACCGCACTCTGAACCTCTCCCTCAAACCACGCTCCTTTGAAATGTTGGAAAACGATGCT 
ArrrTGACTTATrrGGAAAGCAATGGCGGTITCATGACCTTAAATGACAAGTCATCTCCAGACGACATC^ 
CCTTTGGCATTTCTAAAGGTCAGTTCAAGAAAGCTTTAGGTGGTCTTATGAAGGCTGGTAAj^ 

35 GTTrGGGACAGAGTTGATTTAG 

4139.8 

ATGAAAGATGTTAGTCTATTTTTATTGAAAAAAGTTTTCAAAAGCCGCTTA/^ 

ATCTGTACTCGGTGTTAC Li i i i ATTTAAATAGTCAGACTGCAAACTCACACAGCTTGGAGAGCAGGTTGGAAAGT 
40 CGCATTGCAGCCAACGAGAGGGCTATCAATGAAAATGAAGA GAAA CTCTCCCAAATGTCTGATACCAGC TCGGA G 
GAATACCAGrrTGCTAAAAATAATTTAGACGTGCAAAAAAATCrrTTGACGCGA^^ 

TAAAAGAAGGGCGCTCGAAAGAAGCCTACTATrrGCAGTGGCAAGATGAAGAGAAGAATTATGAATTTGTATCAA 
ATGACCCGACTGCTAGCCCTGGCTTAAAAATGGGGGTTGACCGCGAACGGAAGATTTAC CAAGC CCTGTATCCCT 
TGAACATAAAAGCACATACTTTGGAGTTTCCGACCCACGGGATTGATCAGATTGTCTGGATTn-AGAGGT^ 

45 CCCAAGTTTGTTTGTGGTTGCTATTATTTTrATGCrAACACAACTATTT 

ACAGCTCACTTATATCCTGTrrCAAAAGTGACATTTGCAATATCCTCTCTTGGAGTTGGAGTG^ 
TGCTGTTTATCGGAATCTGTGGCTTTTCnTrrCTAGTGGGAAGTCTGATAAGTGGTT^ 
TACCCAATTTATAGCTTAGTGAATCAAGAAGTAACTATTGGGAAAATACAAGATGTATTATTTCCTGGCT^ 
TAGCrrrCTTAGCCTTTATCGTCATrGTGGAAGTTGTGTACTTGATTGCTTACT^^ 

50 CT Cl Tlcn 1 CACTCATTGGGATTGTTGGCTTATTGTTTGGTATCCAAACCATTCAGCCTCTTCAAAGGATTGCACA 

TCTGATTCCCTTTACTTACTTGCGrrCAGTGGAGATTTTATCTGGAAGATTACCTAAGCAGAT^^ 
TAAATTGGAGCATGGGAATGGTCTTACITCCnTGCCTGATTATC !■ r r I IGCTATTGGGAA 
TGGGGAAGTTCACAGAAAAAAGAATTTTTTAATAGATTCTAG 

55 4141.1 

ATGATGAAGTTCATATTGGATATTGTTAGTACACCAGCTATTTTAGTAGCTTTAATTGCAATCTTAGGA^ 
TCAGAAGAAGAAATTACCTGATATTATTAAAGGTGGAATTAAGACCTTTGTTGG' i ' ri L ' i 1 AGTTGTATCTGGTGGT 
GCAGGAATTGTACAAAATTCTTTAAATCCATTTGGTACCATGTTTGAGCATGCTTTTCATT^ 

GAATAATGAAGCAATTGTAGCTGTAGCTTTAACAACATATGGCTCAGCTACTGCAATGATTATGTTTGCAGGCA^^ 
60 GTGTTCAATATCTTAATCGCTCGTTrrACTCGATTTAAATATATTTTTrTA^ 

ATGTATGATTGCGGTCATTTTATCAGTTGCTGGCrrTACTAGCTTGCCTCTCATCT^ 

GTATTATTATGAGTATTTCCCCAGCATTTGTGCAAAAATATATGGTTCAATTAACTGGAAATGACAAGG^ 
AGGTCATTTCAGTTCTTTGGGATATTGGTTGAGTGCjI 11 lACTGGTAGCCTTATCGGTGACAAATCAAAATCAACA 
GAGGACATTAAATTTCCAAAGAGTTTAGCl' 1 I i l i ACGTGATAGTACTGTTAGTATTACTTTATCCATGGCAGTTAT 
65 TTACATTATTGTAGCTATCTTTGCAGGGTCAGAATATATAGAAAAAGAAATCAGTAGTGGTACAAGTGGTCTAGTT 
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TATGCmACAATTAGCAGGTCAATTTGCAGCAGGGGTATTTGTrATTTrAGCAGGTG^^ 
AATTGrrCCAGCCTrrAAAGGTATrTCAGAGCGTCrrGTACCTAATTCAAAACCrG^^ 
ATACTTATGCACCCAATGCAGTTCTAATTGGATTTATCTCTAGTTTTGTrGGTGGTT^ 
ATrGCTTCAGGAACGGTTGTTATCTTACCAGGTGrrGTGCCTCATTTCTTCTGT^^ 
5 GAATGCATCTGGTGGTGTrCGTGGAGCCACTATTGGAGCATnTTACAAGGTATrrrAi^^ 
rrrTAATGCCAGTTTTGGGAGGACTTGGTTTCCAAGGATCAACrrrCTCAGATO 

ArmAGGAATGTTAAATCAATTTGGCTCAC>u4.GCAGGCATTGTGATrGGTCriG I'iCT rATTCTAGCAGTTATGTT 
TGGAGTATCCnTATTAAAAAGCCATCTGCAACGGAGGAATAA 

10 4142 3 

ATGATTAAAACATTTCTCTCTGCCCrrTCGGTCATTCTCTmCTATCCCTA 

TCTAATOTAACATrTGGCTATCrACCCAACCTATCTTGGCACAGATTTATGCCTrCC^ 

GGCrGCTATTITAAGnTCTT Ai Hi Hi I CCTATCTTTrrACAAGAAAAATAAACAAATACGGTTITACTCTGGCA 
TTTrGCTCTTACTATCGCTCATATTACTATTATTCGGAACAGATAAAACCCm 
15 ACCTTAAAATTAGTAACrrGGAACGTCGCTAATCAAATAGAAGCACAACATATrGAGCGAATT^ 

ACGCCGATATGGCTATATTCCCTGAACTAGCTACCAAT ATCAG AGGTGAGCAAGAAAACCAGAGAATCAAACTAT 

TGTTTCATCAAGTTGGACTrrCTATGGCCAACTATGATATITrCACTrCT CCACCT ^ 
GTGACTGTGATTGTCAAGAAAAGTTATGGTTTCTATACAGAAGCTAAAACTTTTCAT^ 

TrGTATTACATTCGAGAAAACAAAATATACCAGATATCATTGCCTTGCATACTGCGCCTCCTCTGCCAGGm 
20 GGAAATCTGGAAGCAAGACTTAAACATCATrCATAATCAATTGGCTTCAAAATATCCAAAGG^^ 

GATTrrAATGCAACTATGCGTCATGGAGCACTTGCAAAAATAAGCTCTCATAGGGACGCATTAA ATGC ACTG 

CCrmGAAAGAGGAACXTGCAATAGCCAAAGTCCAAAACrrTTTAATGCAACAATAGATCA-^^ 

AAAACCACTACTATGrrAAAGATTTAGACATrGTAAGTTTTCAAAACTCTGATCATA^ 

CACATnTAA 

25 

4142.4 

ATGAATCCAATCCAAAGATCTTGGGCTTATCTCAGCAGAAAGCGACTGAGAAGTTrTATTTTAm 

TGGTCrrATTGGCCGGAATTTCAGCCTGTTTGACTCTGATGAAGTCCAACAAAACAGTAGAAAGC^^ 
ATCACTCAATACATCTTTrrCTATTAAGAAGATAGAGAATGGTCAGACArrCAACTTGTCAGACCT^ 
30 aGCAAGATTAAGGGGCTGGAAAATGTCTCTCCTGAACITGAGACGGTCGCAAAACTAAAj\GACAAGGAAGCACT 
ACTGGCGAGCAGAGCGTGGAGCGTGATGATrrATCAGCTGCAGACAATAACTTGGTTAGCTTAACGGCTC^ 
GATTCATCCAAGGATGTAACCTTTACCAGTTCGGCrrTTCAATCTAA AAGA AGGGCGACACCTTC 
TCCAAGAAAATCCTTATCCACGAAGAATrGGCTAAGAAGAACGGTCTTTCGarrC ATGAC AAGArrGG 
CTGGTCAGTCTGAATCTGGAAAAGGAGAAACAGTAGAGTn'GAGAlTATCGGCATCTTTTCTGGTA^ 

35 AGAAATTCACAGGCTTGTCTTCTGACnTCAGTGAAAATCAAGTCTrTACAG 

GGGCAATAGTGAAGCTCAAGTCAGTGCAGCACGCTTCTATGTAGAAAATCCTAAGGAAA TGGACG GACTCATGAA 
GCAGGTAGAAAACTTGGCCTTGGAAAATCAAGGCTACCAAGTCCAAAAGGAAAACAAGGCl I I IGAACAAATCAA 
AGACrCAGTTGCAACTTTCCAAACCTTCCTGACCATCTTCCTTTATGGGATGTTGATAGCAGGAGCT^ 
ATTCrGGTTTTGTCTCTCTGGTTGAGAGAACGGGTCTATGAAGTGGGGATrrrACTTG 

40 GCTCGATCTrCCTACAATTCTGTrTAGAGGTAGTTTrGGTATCTCTTGGAGCTTTGC^ 

AACGCAATCACAACrrACCTACTCCAAACTCTACTAGCAAGTGGAGATCAGGCAAGCTTACAAGATACACTAGCC 

AAAGCAAGCAGTTTATCAACTAGCATCTTATCTTTTGCAGAATCCT ATGTT T^ 

TGTAGCCCTTTGTTTCCTATTCTTATTrAGAAAATCACCGAAAGAAATTTTATCATCTAT^ 

45 4142.5 

ATGTTACACAACGCATITGCCTATGTTACAAGGAAGTTmCAAATCGATTGTCATCTrCCTGA TT^ 
GGCGAGCTTGAGTTTGGTCGGCTrGTCAATCAAGGGAGCTACTGCCAAGGCTTCTCAGGAGACCTTrA.^^ 
ACCAATAGCTTCTCCATGCAAATCAATCGTCGCGTCAACCAAGGAACGCCTCGTGGTGCTGGGAATATCAAGGGT 
GAAGACATCAAAAAAATCACCGAAAACAAGGCCATTGAGTCTTATGTCAAACGTATCAACGCTAT CGGA GATTTG 
' 50 ACTGGATATGACCTGATTGAAACGCCAGAAACCAAGAAGAATCTCACTGCTGATCGTGCCAAGCGTTTTGGAAGT 
AGCTTGATGATTACAGGTGTCAATGACTCCTCTAAAGAAGACAAGTTTGTCTCTGGrrCTTATAAACTAGTra 
GAGACCACTTAACCAACGACGACAAGGATAAAATCCTCTTGCACAAGGACTTGGCAGCCAAACACGGCTGGAAA 
GTAGGGGACAAGGTTAAACTGGACTCTAATATCTACGATGCAGATAATGAAAAAGGAGCC AAGGA AACAGTTGA 
AGTGACAATCAAGGGACTCTTTGATGGTCATAATAAGTCAGCAGTAACCTACTCACAAGAACTTTACGAA^ 
55 AGCTATTACAGACATTCACACTGCTGCAAAACTTTATGGATACACAGAAGACACAGCCATTTATGGGGACGCAAC 
CTTCTTTGTAACAGCAGACAAGAACTTGGATGATGTTATGAAAGAGTTGAATGGCATCAGTGGTATCAACTGGAA 
GAGCTACACACTCGTCAAGAGCTCCTCTAACTACCCAGCTCTTGAGCAATCTATCTCTGGTATGTACAAGATGGCC 
AACCTCCTCTTCTGGGGTAGCTTGAGCTTCTCAGTTCTCCTCCTTGCCCTCTTGCTCAG^ 
TCGCAAGGAAGTGGGAATTCTCCTCTCTATCGGCCTCAAGCAGGCAAGTATCTTGGGTCA 
60 ATTCATCACCGAATCTATCrrGATTGCTATCCCTGCTCrAGrrrCTGCTTACTTCCTAGC^ 

CAATTGGAAACACTGTCCTTGCCAATGTGACTTCAGGTGTTCCCAAACAGGCTAGTAAGGCGGCTCAACCCT 
ACCTTGGTGGTGGTGCAGAAGTAGATGGCTTTAGCAAGACCTTGTCGAGCCTAGACATTTCCATTCAGACATCAGA 
CTTTATCATCATTTTTGTCCTTGCCTTGGTTCTAGTGGTTCTCGTTATGGCGCTTGCTTC^ 
AACAACCAAAAGAGCTCTTGCTGGATGGTGAATAA 
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ATGTCACAGGATAAACA-^ATGAAAGCTGTrrCTCCCCnTCrGCAGCGAGrrATCAAT^^^ 
GGGTTGGGAGTrrGATTITCTGTATTrGGGCTTATCAGGa^GGATm 
ATCCAGCAGGCAGGCATCrGGGGTCCACCTCTCTTTATCTTnTACAGArrrrACAGACT 
5 AGGGGCCrrGACCTCGGTGGCTGGGCTCTTTATCTACGGGCACATCATCGGGACrATCTAC^ 
GTGATTGGCTGTGCCATTATCTTrTATCTAGTGCGCCTATACGGA^ 
GCACCTACGACAAGTACATCGACTGGCTAGATAAGGGCAATCGTTrTGACCGCrrCTTT 
GCCCATTAGCCCAGCTGACTn*CTCTGTATGCTGGCTGCCCTGACCAAGATGAGCTTCAAG(X^ 
ATCATTCTGACCAAACCCTITACCCTCGTGGTrrATACCTACGGTCTGACCTATATTATTGAC^^ 

10 GCTTTGA 

Itcagaaatatgtgggttgtaatcaaggaaacctatcttcgacatgtc^ 

rrrCGCCGTrCCTCrrmAGGAATCTCTGTAGGAATTGGGCATCTCCAAGGrrCTrC^^ 
15 GTGGCAGTAGTGACAACAGTGCCATCTGTAGCAGAAGGACTGAAGAATGTAAATGGTGTrAACTTCGACTA^ 
GACGAAGCAAGTGCCAAAGAAGCAATrAAAGAAGAAAAATTAAAAGGTTArrrGACC^ 
GTTCrAAAGGCAGrrrATCATGGCGAAACATCGCrrGAAAATGGAArrAAATTrGAGGTTACAGCT^ 
AACrGCAAAATCAGCTTAATCGTTCAACTGCrrCCTTGTCTCAAGAGCAGGAAAAACGC^ 
AATTCACAGAAAAGATrGATGAAGCCAAGGAAAATAAAAAGTTTATTCAAACAATTGCAGCAGG^^ 
20 TCTTTCrrrATATGATrCTGATTACCTATGCGGGTGTAACAGCTCAGGAAGTrGCCAGTG/^^ 

TATGGAAGTCGTTTTITCTAGCATAAGGGCAAGTCACTATTrCTATGCGCGGATGATGGCTCTGT^ 
TAACGCATATTGGGATCTATGTTGTAGGTGGTCTGGCTGCCGTTTTGCTCTTTAAAGArrr^ 
TCTGGTATmGGATCACrrGGGAGATGCTATCTCACTGAATACCTTGCTCTTTATm 
CGTAGTCTTGGCAGCCTTCCTAGGATCTATGGTTTCTCGTCCTGAGGACTCAGGGAAAGCCTrGTCGCC^ 
25 ATrrrGATTATGGCTGGTTTTmGGAGTGACAGCTCTAGGTGCAGCTGGTGACAATCTCCTC^ 
TTATATTCCCTITATTTCGACCTrCTITATGCCGrrrCGAACGATTAATGACTATG 
ATrrCACrrGCTATTACAGTGATTmGCGGTCCTAGCAACAGGATTTATCGGACGCATG^ 
TCAAACGGATGATTTAGGGATTTGGAAAACCTrTAAACGTGCCTTATCTTATAAATAG 

30 4144 3 

ATGACAGAAACCATTAAATTGATGAAGGCTCATACrrCAGTGCGCAGGrTTAAAGAGCAAGAAATO 
GACTTAAATGAGATTrrGACAGCAGCCCAGATGGCATCATCTTGGAAGAATrrCCAATCCT^ 
TACGAAGTCAAGAGAAGAAAGATGCCrrGTATGAATTGGTACCTCAAGAAGCCATrCGCCAGTCTGCTGT^ 
TCrCrrrGTCGGAGATrTGAACCGAGCAGAAAAGGGAGCCCGACrrCATACCGACACCTTCCAACCCCA^^ 
35 GGAAGGTCTCTTGATTAGTTCGGTCGATGCAGCTClTGCTGGACAAAACGCCTrGTTGGCAGCTGAAA^^ 
TATGGTGGTGTGATTATCCGTTTGGTTCGATACAAGTCTGAAGAAGTGGCAGAGCTCXrTAACCr^ 
CCTATTCTGTCrrrGGGATGGCACTCGGTGTGCCAAATCAACATCATGATATGAAACCGAGACTGCCAC^ 
ATGTTGTCrrrGAGGAAGAATACCAAGAACAGTCAACTGAGGCAATCCAAGCTTATGACCGTGTrCAGGCT^^ 
ATGCTGGGGCGCGTGCGACCACAAGCTGGAGTCAGCGCCTAGCAGAACAGrrTGGTCAAGCTGAACCAAGCTCAA 

40 CTAGAAAAAATCTTGAACAGAAGAAATTATTGTAG 
4146 1 

ATGTrAAAACTTATTGCTATrGTrGGAACAAATTCAAAACGTrCTACAAACCGTCAArrGCT^ 
AACACriTACTGACAAAGCTGAAATTGAACTTGTrGAAATCAAGGCCATTCCTGTC^ 

45 GCAAGTACCTGCTGAAATATTGGAAATTGCTGCTAAAATCGAAGAGGCAGATGGCGTTATTATCGG^^ 
GTATGATCACTCTATTCCAGCTGrrrrGATGAGCGCTCnTGCTTGGrrGTCrrATGGTATT^ 
AACCAATCATGATTACAGGTGOTCTTACGGTACGCITGGTrCATCTCGTGCCCAATTGCAACTTC^ 
GAATGCTCCTGAAATCAAGGCAAATGTTCTTCCAGATGAArrCTTGCTCTCACACTCTCTTCAA^ 
AGTGGCGACrrGGTrGACCTTGATGTTATCAAGAAATTGGATGCCATCTTTGATGACTTCCGTATCT^ 

50 TCACAGAAAAATTACGTAATGCACAAGAATTACTTCGCAAAGATGCTGAAGACTTrGACTGGGAAAATT^ 

4146 2 

ATGAATACCTATCAATTAAATAATGGAGTAGAAATTCCAGTATTGGGATTTGGAACm^ 

GAAGAAGCCrATCGTGCAGTGTrAGAAGCCTrGAAGGCTGGTTATCGTCATATrGATACGGCGGCGATITATCA^ 
55 ATGAAGAAAGTGrTGGTCAAGCAATCAAAGATAGCGGAGTrCCACGTGAAGAAATGTTCGTAACTACCAAGCi 1 1 

GGAATAGTCAGCAAACCTATGAGCAAACTCGTCAAGCTTTGGAAAAATCTATAGAAAAACTGGGCT^ 
TGGATTTGTATrrGATTCATTGGCCGAACCCAAAACCGCTCAGAGAAAATGACGCATGGAAAACT^^ 
AAGTTTGG AGAGCGATGGAAGACCTCTATCAAG AAGGGAAAATCCGTGCTATCGGCGTTAGCAA i l 1 1 u 1 1 CCCC 
ATCATTTGGATGCCTTGCTTGAAACTGCAACTATCGTTCCTGCGGTCAATCAAGTTCGC TrGGCG CCAGGT 
60 TCAAG ATCAAGTCGTAGCrTACTGTCGTGAAAAGGGAATTTTATTGGAAGCTTGGGGGCUi i i l GGACAAGGAGA 

ACTGTTTGATAGCAAGCAAGTCCAAGAAATAGCAGCAAATCACGGAAAATCGGTTGCrCAGATAGCCTFGGC^ 
GAGCTTGGCAGAAGGATTTrTACCACrrCCAAAATCTGTCACAACCTCTCGTATTCAAGCTAATCTTG 
GGAATTGAACTGAGTCATGAGGAGAGAGAAACCTTAAAAACGATTGCTGTTCAATCGGGTGCTCCACGAGTTGAT 

GATGTGGATTTCTAG 

65 



45 



4147 1 — — . 
ATGAGGTGCA>UW^TGCTTGATCCAATTGCTATTCAACTAGGACCCCTAGCCATTCGTTG 

TGACAGGCrrGATrCTrGCGGTrrATTTGACCATGAAAGAAGCACCTAGAAAGAAGATr^ 

AGATrrTATCTTAGTAGCCrrrCCCTrGGCTATTlTAGGAGCTCGTCTCTACT^ 

5 TAGTCAGAATTTAGGAGAGATTTrTGCCATTrGGAATGGTGGTTTGGCCATTTACGGTC 

CTTGTGCrCTATATCTTTGCTGACCGTAAACTCATCAATACrrGGGATTTTCT - 

GATTOCTCAAAGTrrGGGCCGTTGGGGTAATrrCTTTAACCAAGAAGCTTAT^ 

TATCTACCTGGCTTTATCCGTGACCAGATGTATATTGAGGGGAGCTACCGTCAACCGACrm 

TATGGAATCTGCTTGGCTTTGCCTTGATTCTGATTrrrAGACGGAAATGGAAGAGTCT 

10 GGCCrmACTTGATTTGGTATGGTTTCGGTCGTATGGTTATCGAAGGTATGCGAACAGATAGTCr^^ 

GCrrrCGAGTGTCCCAATGGCTGTCAGTTGTCCTTATCGGTCTCGGTATAATGATCGTTATT^ 

GAAGGCCCCTTACTATATTACAGAGGAGGAAAACTAA 

4147 2 

15 ATGGGTAAATTATCCTCAATCCnTTTAGGAACCGTTTCAGGTGCAGCrCTTGCCTTGTT^ 

GCAAACAAGTTTGCAGTCAGGCTCAAGATTTTCTAGATGA TTTGA GAGAAGATCCGGAGTATGCCAAGGA 
TCTGTGAAAAACTGACAGAAGTTAAGGAGCAGGCTACAGATTTT GTTCTG AAAACAAAAGAACAGGT^ 
GTGAAATCACTGTGGACAGTATACTTGCTCAAACTAAATCCTATGCTrrrCAAGCGACAGAAGC^^ 
AATTAAATAATCTCAAGGAGCAATGGCAAGAAAAAGCCGAAGCrCTTGATGACrCAGAAC^ 
20 TAACAGAAGAATAA 

4147 3 

ATGAAAACTAAATrGATCTTTTGGGGCTCTATGCTCTTrCTCCrCTCCCTCTC 

TTGGATTTTCTATCCTATGGAGATTCAGTGGCTAAACTTAACGAATCGAGTCTATCTAAAACCAGA^ 
25 TACAATirrCATATCTrGATGAATTATCTGACCAATCCrrrrAGTCAGGTCTr 

AGCAGCTGGTCTGCACCATTTCGCAGTGGTCAAGAATCTCTTTCATTTGGTTCAGCTAGT 

CCAAGTITCTATGTCTTTGTCAATAGGATrGTGAAAAAGGACrri TrGT CT 

AGTAGTCTTACCTGTGATGATTGGACTrGGGGGAGrrrrGATTGGTTTTGACCAATrcnTr^ 

TTCTCTrrGTGGGAGATGATACCTGGOTTTTGATCCAGCCAAGGA^ 
30 TTTCrrCATGCCTrCCrCCT CI ' i ' i ' rri 'GCCCTCTATGAAAACi ICl I IGGCTATCTGTATCTCAAAAGTCGTAGGAA 

GTGA 

4149.1 

ATGACTTATCATTTTACTGAAGAATACGATATTATTGTAATrGGTGCGGGACACGCT GGGG TTGAGGCTTC 

35 CCGCTAGCCGTATGGGCTGTAAGGTCCTGCrTGCGACCATCAATATTGAAATGCrGGCTTTCATGCC^ 

CTCTATCGGTGGTTCTGCCAAGGGGATTGTCGTGCGTGAAGTCGATGCCCTCGGTGGCGAGATGGCCAAAACCATT 
GACAAGACTTACATCCAGATGAAGATGCTAAACACAGGGAAGGGGCCAGCTGTCCGTGCCCTrCGTGCGCAGGCT 
GACAAGGAACrrrACTCTAAGGAGATGCGCAAGACGGTTGAAAACCAAGAAAATCTGACCCTTCG 
ATTGATGAGATTTTGGTGGAAGATGGCAAGGTTGTCGGTGTGCGTACAGCCACCCATCAAGAATATGCTCCTAAG 

40 GCTGTTATTGTGACGACAGGGACTGCTCTCCGTGGGGAAATTATCATCGGAGACCrCAAGTAC TCAT CAGGTCCTA 
ACCACAGCTTGGCTTCTATTAACCTAGCTGACAATCTCAAGGAACTGGGTCTCGAAATCGGTCG^ 
AACCCCTCCACGTGTCAAGGCTTCTTCTATCAATTACGATGTGACAGAAATTCAGCCAGGAGACGAAGTGCCTAAT 
CATTTCTCATACACTTCACGTGATGAGGATTATGTCAAGGACCAAGTACCATGCTGGTTGACCTATACCAATGGTA 
CCAGTCATGAGATTATCCAAAACAACCTCCACCGTGCGCCTATGTTTACAGGTGTGGTCAAGGGAGTGGGGCCTC 

45 GTTACTGTCCGTCGATTGAAGACAAGATTGTGCGCrn'GCGGACAAGGAACGTCACCAACTCTTCC^ 

AGGGCGCAATACTGAGGAAGTCrrATGTGCAAGGACTITCAACCAGTCTGCCrGAGGATGTCCAGCGTGACTTGGT 
GCATTCCATCAAAGGTTTGGAAAATGCAGAGATGATGCGGACAGGTTATGCTATTGA 
GTATGATATGGTCTTGCCTCATCAGTTGCGTGCGACriTGGAAACCAAGAAAATCTCAGGTCTCTTC^ 
CAGACAAATGGAACATCAGGTTACGAAGAGGCAGCACGCCAAGGGATTATCGCGGGTATCAATGCGGCTCTGAA 

50 AATCCAAGGCAAGCCTGAATTGATTTTGAAGCGCAGTGATGGTTATATCGGGGTGATGATCGACGACTTGGTGAC 
CAAGGGAACCATTGAACCCTACCGTCTCrrGACCAGTCGTGCTGAATACCGTCTCArrCTTC GTCAT GACAATGCT 
GATATGCGCTTGACTGAGATGGGACGCGAGATTGGCCTTGTGGACGATGAACGCTGGGCTCGTTTTGAAATO 
AAAAATCAATTTGATAATGAGATGAAGCGCCTAGACAGTATCAAACTCAAGCCAGTCAAGGAAACCAATGCCAAG 
GTTGAGGAGATGGGCTTCAAACCCTTGACCGATGCAGTGACAGCCAAGGAATTCCTTCGCCGTCCAGAAGi I lui i 

55 ACCAAGATGTGGTGGCCTTCATCGGACCAGCTGCAGAAGACTTGGATGACAAGATrATCGAATTGATTGAAACAG 
AAATCAAGTATGAAGGCTATATTTCCAAAGCCATGGACCAGGTTGCCAAGATGAAACGCATGGAAGAAAAACGCA 
TTCCGGCCAATATCGACTGGGATGACATTGATTCTATCGCAACCGAAGCCCGTCAGAAGTTCAAACTCATCAATCC 

agaaaccatcggccaagccagccgtatttccggagtaaacccagcagatatttctattttgatggtgtatctggaa 
ggtaaaaatcgtagtatttctaaaactcttcaaaaatcaaaatga 

60 



4149.2 

ATGAAAGTATTAGCrrrrGATACGTCCAGCAAGGCTCTTTCTCTGGCTATTTTAGAGGATA^ 
65 AGACGACGATTAATATTAAGAAAAATCACAGTArrACTCTTATGCCTGCCATCGAl 1 T 'f T TG ATGGCAAGTTTGG A 
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rrGGACACCCAAGGATTTGGACCGAATCCrrGGTAGCTGAAGGGCCGGGTAGCTATACAGGCnTCCGAATTG 
AGCAACTGCTAAGACCnTAGCTCACACCCTGAACATCGAGrrGGTTGGTATCTCGAGTCr^ 
CATCAACAAGAAG G i ' l ' lU I ' l i GTCCCCTrGATGGATGCGCGTCGCAATAATGriTATGCAGGATTTTATGAAAATG 

ccaaacctgtcatggcagaagcgcacctatcrittgaagaggtgctagaaaaagtcaagggtactagtcagg^ 
5 cgtttgtcggagaagttggcccctttgttgagcagattcaaaaacacttgccaaggact^ 

gcccaatgcagctaatctagctcttttggcctgggacaaggaagcagactccrrgcatgatrrrct^ 
ctcaaacgagtcgaggctgaggaaaactggctcaagaaccataccgagtctggcgagtcrracarraaacgccta 

TGA 



ATGATAGAAATCAAGCGAATTCAACAACAGCCTGACCTAGCTCAAGCCATCTACGCTGTTATGGCAGCTGTTTAC^ 
TAGTCAGTCCTTGGACTCTGGAGCAAATCCAAGCAGATCTGTCCCAAGACCAGACT^ 
TGGGGCAGAAGTGATTGGATrTCTAGCTGTGCAGGAGAATCTTTTTGAAGCAGAAGTCCTGCAA ATCGC T 
AGGAGCTTATCAGGGTCAGGGGATTGCGTCagCCTrGTTrGCTCAATTGCCGACAGACAAGGAAATm 
15 GTCAGACAGTCAAATCAACGAGCGCAAGCATTTTACAAGAAAGAAAAGATGACAGTTATCGCTGAGCGAAAGGC 
CTACTACCATGACCCAGTCGAGGACGCCATTATCATGAAGAGAGAAATAGATGAAGGATAG 



**LJ^.^ 

ATGACAAAACAAGTCTTATTAGTGGATGATGAAGAACACArrCTGAAATTGCrTGACTACCA T^ 
20 GGCrmCrACTCAATTGGTGACAAATGGACGGAAGGCCTTACCTTTGGCAGAAACAGAACCC^ 

TGCTTGATATCATGTrACCACAATTAGATGGCATGGAAGTTTCTAAGCGGCTGAGAGCCAAAGGCGTCAAAACT^ 
CAATTATGATGGTTTCTGCGAAAAGTGATGAATTTGATAAGGTTTTGGCCTTGGAA^^ 

GACCAAGCCTTTTAGCCCTAGAGAATTGCTGGCGCGTGTCAAGGCTGTCCTCAGGCGAACTAAAGGAGAACAAGA 
AGGAGATGATTCAGATAATATCGCTGACGATTCrrGGCTATTTGGGACCTTGAAAGTATACCC TGAGCGT CATGAA 
25 GTCTACAAGGCGAATAAGTTACTGAGTTTGACCCCAAAAGAATTrGAAAGCGATAAAAATCCGTT^^ 
TCAAAGTfrCGAAAGTAACCGCCCAATAA 

4154.1 

ATGACrACrTTTAAAGATGGATTTTTATGGGGTGGTGCTGTTGCTGCTCATCAACT^ 
30 GTGGCAAGGGAATTAGTGTTGCTGATGTTATGACTGCTGCTCGTCATGGAGTAGCTCGTGAAATTACTTr GGG 

TTTAGAGGGTAAATATTATCCAAATCATGAGGCGATAGATTTTTATCACCGTTATAAAGAAGATATAGCAUi 1 1 1 i 
GCTGAAATGGGATTCAAGTGCTTCCGTACCTCTArrGCATGGACACGTATCITTCCAAAAGGTGATG 
CGAATGAAGAAGGATTACAGTTTTATGATAATCrrnTGATGAATGCTTAAAGAATGGTATrG^^ 
TCTATCTCATTTTGAAATGCCTTATCACTTAGTGACCGAATATGGTGGTTGGAAAAATAGG AAAT TGA^^ 
3 5 TTTGCTCGTTTTGC AG AAGTCGTATTTAAACGTr ACAAAG ATAAGGTT AAATATTGG ATC 

ATAATCAAGCGAATTATCAGGAAGATTTTGCACCATTTACTAACTCAGGTATTGTATATGAGGAAGGTGATAATAG 
AGAAGCAATTATGTATCAAGCAGCACATTACGAATTAGTTGCTTCTGCACGAGCTGTAAAAArrGGTCATGAGAT^ 
AATCCAGATTTTCAAATAGGTTGTATGATTGCGATGTGTCCAATTTATCCAGTTACTTGCAATCCT 
AATGGCAATGAAAGCTATGCAGAAGCGTTATTATTTrGCTGATGTGCAT 
40 TTCAAGTATTGGGAACGAAAAGGTATrrCAGTTGATrTTACTGCCCAGGATAAAGAAGATTTA^ 

TAGATTACATTGGTTTCAGTTACTATATGTCCTITGCTATCGACTCTCATCGTGAAAATAATCCTTAT^ 
CTTGAAACAGAAGATTTAGTGAAAAATAATTATGTTAAGGCTTCTGAATGGGAGTCGCAAATT GATC^ 
TTGCGTTATGCGTTAAATTGGTTTACAGACCACTATCACTTACCACTCTTTATTGTTGAA^ 

AGATCAAGTTGCAGCAGATGGTATGGTACATGATGATTATAGAATTGAATATCTAGGTGCCCATATTCGTGAAATG 
45 AAAAAGGCTGTAGTTGAAGATGGTGTTGATTTAATGGGTTATACTCCATGGGGATGTATTGATTTGGTT^ 

GTACCGGTGAAATGCGGAAACGTrATGGCTrTATTTATGTAGATAAAGATGATAATGGGAAGGGAAGTTATA^ 
GTTCCCCGAAAAAATCTrrTGCCTGGTATAAGGAAGTTATTTCATCTAACGGTGAATCAGTAGAATAG 

4154.2 

50 ATGGATCAACAAAACGGGriGi'i'iGGi 1 i iCi IGAAAACCATGTTaTGGGACCAATGGGCAAAC TTGCTC AGTTTA 

AAGTAGTACGTGCTATCACGGCTGCAGGTATGGCTGCTGTACCATTTACTATTGTAGGATCAATGTTm 
CAGTATTTTGCCACAAGCTTTCTCATTTTGGCCAATTGTGGCAGATATTTTCTC^^ 
CACmACATGGTTGCAAACTATGCGACTATGGGTTCTCTATCTCTTTATTTCGTTCT^ 

ACAAAAATTT ATGCAG AGG AAG AAG AACTCAATATG AATCCTCTTAATGGTGCCTTGCTTGCCTTG ATGGc^ i i ii G 
55 TCATGACAGTACCGCAAATCA'iU'riUGATGGTGGAATGATGAAGACTGTGACAAGTCTAAAAGAAGGTGCAGTAA 
TTGCAGATGGATGGGCAATGGGAAATGTAGTCGCACGTTTTGGGACAACAGGGA l - i ' i - l 1 ACCGCAATCATTATGG 
CAATTGTGACTGTTCTTATTTATCGTATGTGTGrrAAACATAATTGGGTTATTAAAATGCCTGAAGCT 
GGAGTTTCTCGTGGATTTACCGCTTTGGTTCCGGGATTTGTTGTTGCATTTGTTGT^^ 

GTAGCAATGGGAACAGATA i 1 1 1 1 AAAGTCATTGCAATTCCATTTGGTTTTGTATCCAATCTGACTAATTCGTGGA 
60 TTGGTTTAATGATTATTTATCTATTGACTCAACTACTTTGGATTGTAGGTATCCACGGTGCGA^ 

rrrGTTAGTCCAATTGCTCTTGCTAACATGGCTGAAAATGCTGCTGGCGGGCACTTCGCTGTTGCAGGTGAA^^ 
CTAATATGTTTGTAATTGCAGGTGGTTCTGGTGCAACTTTAGGACTATGTTTATATATTGCiU^ 

GAACAGCTTAAAGCAATACGACGAGCATCTGTAGTTCCAGCCTTATTTAATATTAATGAACCATTAA^l^ I IGGAT 
TACCrATTATCTATAATCCAGCCTTGGCTATACCATTTATTTTAGCACCAATGGTTACTGCTACT^ 
65 GCGAATTCTCTAAACTTTATTAAGCCAATTATCGCACAGGTTCCATGGCCAACTCCAGTAGGGATTGGAGCTTTCT 
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TAGGGACAGCACATCTTCGAGCTGTATTAGrrGCTCTAGTATGTGCATTTGCAGC^lTCCTAGTCT^^ 
ATCCGTGTATATGATCAAAAATTGGTGAAAGAAGAGCAAGGTATCTAA 

5 ATGi^AAAATTTTATGTAAGTCCAATTTrrC 

TArrrrrGTrAATAATAATCTGrrGACGGrrrrAArrrTGrrTaT^ 

ACTGAGAGTGCATTATACAAGGAGTGATGTAGAACAGATACAGTATGTAAACCACCAAGCGGA^^ 
AGCrCTATTGGAACAGATGCCTGTAGGTGrrATGAAATTGAATTTATCrrCTGGAGAGGTTG^^ 
TATGCTGAATrGATTrrGACCAAGGAAGATGGTGATTTrGATTTAGAAGCTCTrCA^ 
10 TAGGAAATCCGTCTACTTATGCCAAGCTTGGTGAGAAGCGTTATGCTGTrCATATGGATG^^ 
GTATnTGTAGATGTATCCAGGGAACAAGCCATAACAGATGAATTGGTAACAAGTAGACCAGTC 
CTCTGTGGATAATTATGATGATTTGGAGGATGAAACTTCTGAGTCAGATATTAGTCAAATCAAT^^ 
AArmATATCAGAGTTTTCAGAAAAACACATGATGTTTTCTCGTCGGGTAAGTA 
TGACTACACGGTGCnTGAGGGCTTGATGAATGATAAATTTTCTGTTATTGATGCTTT 
15 AGACAGrrGCCCTTGACCTTAAGTATGGGArrrTCrrATGGCGATGGAAATCATGATGAGAT^^ 
TGCTCAArrrGAACTrGGCTGAAGTACGTGGTGGCGACCAGGTGGTTGTTAAGGAAAACGA^^ 
CAGTrrATTTTGGTGGTGGGTCTGCTGCTTCAATCAAGCGTACACGGACTCGTACGCGCGCrATC 
TTCAGATAAGATrCGGACTGTAGATCAGGTTrrrGTAGTCGGTCACAAAAATTTAGACATGGATGCm 
GCTGTAGGTATGCAGTTGTTCGCCAGCAATGTGATTGAAAATAGCTATGCTCTTTATGATG^ 
20 CAGATATTGAACGAGCTGTTTCATrCATAGAAAAAGAAGGAGTTACGAAGTrGTrGTCTGrrAAGGATG 
GGATGGTGACCAATCGTTCTTTGTTGATTCrrGTAGACCATTCAAAGACAGCCTrAACATTATCA^ 
TGATTTATITACCCAAACCATTGTrATTGACCACCATAGAAGGGATCAGGATTITCCAGATAATC^^^ 
TATATCGAAAGTGGTGCAAGTAGTGCCAGTGAGrrGGTAACGGAATTGATTCAGTTCCAGAATTCT^^ 
CGTTTGAGTCGTATGCAAGCAAGTGTOTGATGGCTGGTATGATGTrGGATACTAA^ 
^5 CTAGTCGGACArrrGATGTTGCTAGCTATCTCAGAACGCGCGGAAGTGATAGTATTGCTATCCAGGA^ 
GACAGAtmGAAGAATATCGTGAGGTCAATGAACTTATTTTACAGGGGCGTAAATTAGGTTCA^^ 
GCAGAGGCTAAGGACATGAAATGCTATGATACAGTTGTTATTAGTAAGGCAGCAGATGCCATGTTA^ 
GGTATTG AAGCGAG rr n G 1 1 C i' 1 GCG AAG AATACACAAGGATTTATCTCTATCTCAGCTCG AAGTCGT AGTAAAC 
TGAATGTACAACGGATTATGGAAGAGTTAGGCGGTGGAGGCCACTTTAATTTGGCAGCAGCTCAAATTAA^ 
30 TAACCTTGTCAGAAGCAGGTGAAAAACTGACAGAAATTGTATTAAATGAAATGAAGGAAAAGGAG^ 

TGA 

ATGAAAGAGAAAAATATGTGGAAAGAATTGTTGAATCGTG^^ 
35 TATATCAGGTTCCCCTAGTGGTTACCTCTATTTTGACTTTAAAAGAAGTAGCCCTGCT^ 
TGCTGGCCTITCAATrGTGGTTCTGGCTCTATTrATTATGGGAGCTCGTAAAACCA^^ 
CrmTTTAGAGCTAAAGATTrGGCACGTTTGGGCrrGAGTTATCTAGTTATrGTC^ 

ArmATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTCTCAGATTAATGATATGGTTCAAAATAG 
TGATTTCCAGTTrCTTCrrGCTAGCCnTGCrrGCrCCGArTTGTGAGGAAATCT^ 

40 AAGArrrrCCGAGGCAAGGAGAACTTGGGATTrGTAGTCCCTACGATTGTGTTTGCrTTATO 
ATrrACCTTCrrrATTGArTTATGGAGGTATGTCGACAGTTCTATCrrGGACAGCCT^ 
AATGTCGATCrrGCTTCACATGArrGTTAATGGGATTGCrrrCTGTTTGTTGGCTCT^ 
CATTAGGAATrTCTGTTTAAATGAAAGAGAAAAATATGTGGAAAGAATTGT^^ 
CTTTTTACTTGCCGTCCTTrrATATCAGGlTCCCCTAGTGGTTACCTCTATT^ 

45 TACAGTCAGGGCrGATAGTTGCTGGCCTTTCAATTGTGGTTCTGGCTCTATTTAT^^ 

GTTAGCTAGnTTAA- rrri ' iCi ' I 1 i n i AGAGCTAAAGATTTGGCACGTTTGGGCTrGAGTTATCTAGTTATTGTCG 
GGTCAAATATACrrGGTTCCATTTTATTGCAACTGTCAAATGAGACGACAACAGCTAACCAGTC^ 
TATGGTTCAAAATAGTTCGTTGATTTCCAG'rriCl ILl IGCTAGCCTTGCTTGCTCCGATTTGTGAGGAAATCTTGT 
GTCGTGGGATTGTTCCTAAAAAGATTTTCCGAGGCAAGGAGAACTTGGGATTrGTAGTCGGTACGA^ 

50 TTTATTGCATCAACCAAGTAATrrACCTTCTTTATrGATTTATGGAGGTATGTCGACAGTTCT^ 
ACAAGACCCAACGTTTGGAAATGTCGATCTrGC TTCA CATGATTGTTAATGGGArrGCTTTCT^ 
GTGGTGATTATGAGTCGGACATTAGGAATTTCTGTTTAA 

4156 4 

55 ATGGATACACAAAAGATTGAAGCGGCTGTAAAAATGATTATC GAGGCTG TAGGAGAGGACGCTAATCGCGAGGGC 
TTGCAGGAAACACCTGCTCGTGTAGCCCCTATGTATCAAGAGATrTrrrCAGGTC TrGGTC AAACAG 
ATTrGTCAAAATCCTTTG AAATTATTG ACG ATAATATGGTGGTAG AAAAGG ATATC i'i 1 1 1 CCATACCATGTGTGA 
ACACCACTTCTTGCCArrTTATGGTAGAGCGCACATTGCCTACATTCCAGATGGTCGTGTGGCAGGCTTG^ 
CTAGCCCGTACGGTTGAAGTTTATTCGAAAAAACCACAAATTCAAGAACGTTrGAATATCGAAGTGGCCGATGC^ 

60 TTGATGGACTATCTAGGTGCTAAAGGAGCCTITGTTGTCATTGAGGCGGAACATATGTGTATGAGTATGCGTGGTG 
TTAGAAAACCAGGCACTGCAACCTTGACGACAGTAGCTCGTGGTCTATTTGAAACAGATAAGGATCTCCGTGACC 
AAGCTTATCGTTTAATGGGGCTATAA 



4157.2 
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ATGAAAGACl 101 i 1 AAAGAGAAAGCAGGCCTrrCGTAAGGAGTGTCTTGGTrATCTGCGCTATGTGCrCAATG 
ACCACTrrGTCTTGTTCCTGCTrGTCCTGTTGGGCTTTCTAGCCTACCA 
GAjWiTCATTGGCCTATC Ci rriUl Vl GTAGGAATTACGTCTGTTTTA Lll ' rr AC^ 
TATGGAGGCTCCAGACAAGCTCTTTCTCTTAGTTGGAGAAGAGGAAATTAAGCTCCATCTC^^ 
5 CATTTCCCTAGT Ci - 1 rr GGCTCTTTGTACAGACC ClUnHCir GCTGTTATrrGCGCCr^ 1 i ACCAATGGGTTA 

TG GCTTG CCAG 1 1 rrr CTGCTCTATGTGCTnTATTGGGGGTAGGAAAATATTrCCA Ci 1 i i GTCAAAAGGCCAGCA 
AATTTTrC^CTGAAACTGGACTGGACTGGGACTATGTTAT^ * 
TTTCrrrGCCCTCTTTACGCAGGTCAAGGGAATTTCAAACACCGTTAACCGTCGTGCCTATCT 
AGGCTGTTCAGAAGGTGCCTGGGAAGArrrGGCAAAATCTCTATCTGCGTTCTTATCTGCGAAATCGCGACCTC^ 
10 TGCTCTCAGTCrrCGTCrrCTCrrGCTTTCCTTGCTGGCGCAGG nTrrATCGAGCAAGCTTGGATrGCGACAGCAG 

TGGTA GTTCT CTTTAACTACCTCITGCTCTTCCAGTTGCTGGCCCTCT 

CAACTCTTTCCGCTGGACAAGGGGCAAAAGGAAAAAGGCrrACAGGAGGTAGTTCGAGGATTGACCA G I Tl rOl '^ l 

TTACrTGTGGAATTAGTTGTTGGGTrGATTACCTTCCAAGAAAAACTAGCCCTTCTAGCCTTA^ 

TGGrnTACTAGTCTTGTATTTGCCTTATCAGGTAAAACGTCAGATGCAGGACTAA 

15 

4158.1 

ATGAGAAAATCAATAGTATrAGCGGCAGATAATGCCTATCTTATTCCTTTAGAGACGACTATAAAGTCT^ 
ATCAC AATAG AG ATGTTG A VViVi ATATTCTCAACAGTGATATAGCTCCTG AATGGTTTAAATTATTGGGGAGAAA 
AATGGAAGTTGTGAATTCTACAA7TCGCAGTGTACACATTGATAAAGAA CrrJl " l GAAAGCTATAAAACAGGACCT 

20 CATATAAATTATGCTTCnTACTrTAGATTTTrrGCGACAGAAGTGGTTGAATCTGATAGCCT 

CGATATCATTGTAACTGGGGAACTAGCTACTTTGTTrGAGATAGATCTCAAAGGATATTCAATrGCT 
GATGTCTATGCCTATGAAGGACGAAAATCrGGATTTAATACTGGTATGTTACTAATGGATGTTGCAAACTGGAAAG 
AACATTCTATT GTCAA TAGTrrATTGGAATTAGCGGCCGAGCAGAATCAAGTTGTTCATCTTGGGGATCAGAGTAT 
TrrAAATATTTATTrrGAGGATAATTGGCTAGCCTTAGATAAAACATATAATTATATGGTGGGTATTGA 

25 ACCTTGCrCAAGAATGTGAACGTCTAGATGACAATCCACCTACAATTGTTCACTATGCTAGTCATGATAAACC^ 
GAATACATATAGTATATCT AGACT ACGTGAATTATGCTGGGrrrATAGAGATTTGGATTGGTCAGAGATTGCTT^ 
CAACGrrCCGATTTAAATTATrrrGAAAGAAGCAATCAGTCTAAAAAAC>AGTGATGC^ 
GATATAAAACATTTAGAGTATTrAGTACAACGGTTACCTGATTGGCArmCATTTGGCTGC^ 
TGAGGAGCrGACCTCTCTATCACAGTATACGAATGTAACAGTATATCAAAATGTATTACATAGTAGAATTGATTC 

3 0 CTATTGG ACG ATTCTATAGTTTATTTACATATTAATAC AGGTGG AG AGGTTTTTAATGTAGTTACAAGGGCACAAG 

AAAGTGGCAAGAAAATCTTCGCTTTTGATATCACACGTAAAAGTATGGATGATGGACTCTATGACGGTA I ' i ' i n TC 
TGTGGAGAGACCAGATGATTTAGTGGATAGAATGAAGAATATAGAGATAGAGTAA 

4158.2 

35 ATGACTAAGATTTATTCGTCAATAGCAGTAAAAAAAGGACTATTTACCT 

GAAGTCGTATTATTCTCCCTTTTGTTGACCTAAATACTAAAGATTTTTTAGGAGGTTCAACAGCCT^ 
TCAGCraCCCTAACACCT^GAATCTAAGAAGTTTATCAATTTTTTCT 

TGATTTTATGGCAGATUfll ICri-i'inCTAAACGGTTGGGTTTAACATCTACGTCTATAGAAATACAAGATCGCCGT 
AAAATGTACCTGACCTTGCTAATTGCTGTGATTCAATCCrrGGCAGTTAGCTTGAGACTGCCACT 

40 ATTCTGCAATATTGGTTGTTCTAATGAATACAATATTGCTGATAGCAGGAACATTTriTCT^ 

rTAA ATGCGAGTATGGGGATrGGAGGTTCTATTGTAATCCrCCTATCCAGTATGGTTTTAAATATTCCrCAGGATG 
TrTTGGAAACATTTCAGACAGTACACATrCCAACAGGGATTATTGTGTTACTTGCTTTAT^ 
TATTTACTTGCCCTTATGTATCGAGCTCGCTATTTGGTTCCTGTTAATAAAATTGGOT 
CTATTCTTATCTCGAAATCATGTTGAATCCTGCAGGTGGGATGCCTTATATGTATGTGATGAGTTTrC^ 

45 CAGCTTATTrGTTCATarrGTTGGGATTTATTTTCCCT^ 

GTTGGAAAGCCTTTGTGGGTCTATGTITATATTTCGGTCTTATTTTTAm 

AATGGAGAAGAGATTGCAGACCGTATGAAAAAATCTGGAGAATACATrrATGGTATTTATCCAGGTGCGGATACT 
AGTCGATTTATTAATCGATTGGTCC^ 
^ CATGCTTTrrGTTTTGTTTGATGAAAAGTTATTACGATTGGC 

50 TGATTTTTACGATTAGAGACGAGGTCAAGGCirrAAGGCTAAATGAGACCTATAGACCT^ 
4158.3 

ATGTCCTCTCnTCGGATCAAGAj^ 

TAGACGATATTTTGGTTGAAGCTTTTGCTCTGGTCCCTGAAGCAGATAAGCGGATTTTAGGGATGm 
TGTTCAAGTCATGGGAGCrATTGTCATGCACTATGGAAATGTTGCTGAGATGAATACGGGGGAAGGTAAGACCTT 
GACAGCTACCATGCCrGTCTATTTGAACGCTTTTTCAGGAGAAGGAGTGATGGTTGTGACTCCTAATC 
TCAAAGCGTGATGCCGAGGAAATGGGTCAAGTTTATCGTTTTCTAGGATTGACCATTGGTGTACCATTTACGGAA 
ATCCAA AGAAG GAGATGAAAGCTGAAGAAAAGAAGCTTATCTATGCTTCGGATATCATCTACACAACCAAT^ 
ATTTAGGTTTTGATTATCTAAATGATAACCTAGCCTCGAATGAAGAAGGTAAGTrmACGACCGTT^ 
OU GATTATTGATGAAATrGATGATATCTTGCTrGATAGTGCACAAACTCCTCTGATTATTGCGGGTTCTCCT^^ 

AGTCTAATTACTATGCGATCATTGATACACTTCTAACAACCrrCCTCGAAGGAGAGGATTATATCrrTAAAGAGGA 
GAAAGAGGAGGTrrCCCTCACTACTAAGGGGGCCAAGTCTGCTGAGAATTTCCTAGGGATTGA 
GGAAGAGCATGCGTUi i i i GCTCGTCATTrGGTrTATGCG ATTCCaGCTCATAAGCTCTTTACTAAAGATAAGGAC 
TATATCATTCGTGGAAATGAGATGGTACTGGTTGATAAGGGAACAGGGCGTCTAATGGAAATGACTAAACTTCAA 
GGAGGTCTCCATCaGGCTATTGAAGCCAAGGAACATGTCAAATTATCTCCTGAGACGCGGGCTATGGCCTCGATC 
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ACCTATCAGAGTCTTTTTAAGATCrmAATAAGATATCrGGTATGACAGGGACAGGTAAGGTCG^^ 
TTTATTGAAACrrACAATATGrCTCTAGTACGCATTCCAACCAATCGTCCGAGACAACGGA 
TTGACTATCCAGATAATCTATATATCACTTTACCTGAAAAAGTGTATGCATCCTTGGAGTAC^^ 
TGCTAAGGGAAATCCTTTACTC Ol ' l i ' i i GTAGGCTCAGTrGAAATGTCTCAACTCTATTCGTCTCT Cl iUi i I CGTG 

5 aagggattgcccataatgtcctaaatgctaataatgcggcgcgtgaggctcagattatctccgagtcaggtcaga 
tgggggctgtgacagtccctacctctatggc\ggacgtggtacggatatcaagcttggtaaaggagtcgcagagc 
ttgggggcrrgattgrrattgggacrrgagcggatggaaagtcagcggatcgacctacaaattcgtggccgttct^ 
tcgtcagggag atcctggtatg agtaaa rit 11' i gtatccttag agg atg atgtt atc aag aaatttggtccatcr 
tgggtgcataaaaagtacaaagactatcaggtrcaagatatgactcaaccggaagtattgaaaggtcgtaaatac 
10 cggaaactagtcgaaaaggctcagcatgccagtgatagtgctggacgttcagcacgtcgtcagacrctggagtat 

GCTGAAAGTATGAATATACAACGGGATATAGTCTATAAAGAGAGAAATCGTCTAATAGATGGTTCrCGTGACTTA 

GAGGATGrrCTTGTGGATATCATTGAGAGATATACAGAAGAGGTAGCGGCTGATCACTATGCTAGTCGTGAATTAT 

TGTTTCACrrTArTmGACCAATATrAGTTTTCATGT^ 

GCAGTTCGTAG CTTTA TGAAGCAGGTGATTGATAAAGAACTTTCTGAAAAGAAAGAATTAC^ 
15 TATATGAACAGTTmACGACTTrCACrGCrrAAAGCCATTGATGACAACTGGGTAGAGCAGGTAGACT 

ACAGCTATCCATGGCTATCGGTGGTCAATCTGCTAGTCAGAAAAATCCAATCGTAGAGTACTATCAAGAAGCCTA 
CGCGGGCTTTGAAGCTATGAAAGAACAGATTCATGCGGATATGGTGCGTAATCTCCTGATGGGGCTGGTTGAGGT 
CACTCCAAAAGGTGAAATCGTGACTCATTTTCCATAA 

20 4158.4 

ATGATAGGGACrn'CGCCGCTGCTCTTGTAGCTGTACTAGCAAATTTCATCGTCCCTATTGAAA^^ 
GTGCC^ATACTGAAATTGCACCACCAGATGGGATTGGGCAGGTTCTCAGCAACCTCTTGCrCAAACT 
ACCCAGTCAACGCCCrGCTTACTGCTAACTATATTAGAATCTTATCTTGGGCACTCATTT^ 
AGAAGCCAGTAAAAATAGTCAAGAATTGCTAAAAACTATCGCTGACGTGACTTCTAAAATTGTCGAATGGATC^ 
25 CAATCTG GCTC CATTTGGAATCCTTGGTCriGll'lU' rAAAAC CATTTCTGACAAGGGAGTCGGAAG 
TACGGTATTrrATTGGTTCTArtAGTAACGACTATGCnTn'GTrGCCCCTG^^ 

TrrATGAGACGCAATCCTTACCCTCTAGTTTGGAACTGCCTCCGTGTCAGCGGTGTGACACCCTT^ 
GTTCTGCGACTAACATTCCTGTCAACATGAAACTCTGCCATGACCTTGGACTCAACCCAGATACCrrA^ 
TATCCCACTCGGTrCTACTATCAATATGGCTGGAGTAGCGATTACCATTAACCTTTTCACCCTTGCrGCAGTTAAC 
30 ACTCTrGGAATrCCTGTTGACTTTGCCACAGCCTTTGTCCTCAGTGTGGTAGCAGCTATCTC^^ 
AGGTATTGCCGGAGG TTCCC TCCITCTTATCCCAGTTGCTTGTAGCCTTTTCGGTATI^ 
AAATTCTTGGGGTTGGTTTrGTGATTGGTGTC 
CCTCrrTACCGCCGTTGCCGAATACGCAGCAACCCGTAAAAAATAA 

35 4158.5 

ATGTCTATTAGCCAACGTACGACCAAGCTCATCTTAGCTACCTGTCTTCCCTGCCTGCTrGm 

TTCGTCAGCAGTITCGGCTGGAATTATCGCTCTCTTGAGCCTATCT 

CGCAATCGTCTTTTTTCTATGCTTCTAGCTCTGGCTATCGGTGT^ 

GA GTCTCG GCCTCTATCrGGCCTTCrACGTTCCTTTAGCCTACAAGATGGGCTGGGAAATTGG 
40 ACTUi I I iGGTTAGCCATCTCTTGGTTCAACAGTCAACCTCTCCAGACCTTCTAGTCAATGAATTCCT^ 

TATTGGTACAGGATTTGCCrrGCTTGTTAATCTCTATATGCCTTCACGAGAAGAGGAAATCCAGCACT 
CTGGTGGAAGAAAAGTTAAAAGATATCCTCCAGCGCrTCAAATACrATTTATCCAGAGGAGACGGACGCAACCGA 
G CACA GCTGGTAGCAGAATTAGACACGCTITTGAAAGAAGCCCrCAGACTGGTCrATTTGGATCACTCT 
TCTTTCACCAGACAGACrACCATATCCACTACTTTGAGATGAGACAGCGACAAAGTCGTATCCTGAGAAACATGG 
45 CCCAACAGATTAACACTTGTCACCrrGCCGCCAGTGAAAGCCTGATCrTAGCGCAACTCTTTTCA^ 
TCAACTGAGCCAGACCAATCCTGCTTCTGATTTGCTAGATGAAATTGAACGTTATCTGGAAGTCT^ 
AGTCTGCCCAAGACAAGAGAAGAATTTGAAACCCGCGCCACCCTTCTTCAACTCCTACGTGAAGCCAAAACCT^ 
ATCCAAGTAAAAGTTGATmTACCAAAAATATAGACAGTAA 

50 4158.6 

atggaaatcatgtcgcttgcgattgcn*gtttttgccgtcatcatrggtn*agtcattggatatgtcagcatct 
taagatgaaatcatctcaggaagctgcagagttgatgcttttaaatgctgaacaagaagcaactaattt 
acaagctgagcgtgaagccgatrtacttgttaatgaagccaaacgtcaaagcaagtctcttaaaaaagaagcact 
attggaggccaaagaagaagccagaaaataccgtgaagaagtggacgctgaattcaaatcagaacgtcaagaac 

dd tcaaacaaatcgaaagtcgtngacagagagagctactagccitgaccgtaaggacgacaaritgacgagta^ 
aacaaacacttgaacaaaaagaacaaagtattrctgatagagcgaaaaaccttgatgcgcgtgaagagca^ 
aggaagtcgaaagacaaaaagaagcagaactagagcgtattggtgcgctgtctcaggcagaagcacgagatatt 
atcttggctcagacagaggaaaacttgaccagggagattgccagtcgcattcgcgaagctgagcaagaggtcaag 
gaacgtrctgacaaaatggccaaggacatcttggttcaagctatgcaacgtatcgctggtgaatatgtagcggag 

ou tcaacaaactca acagt tcatctgccagacgatactatgaagggacgcattattggtcgtgaaggtcgtaacatt 
cgtaccmgaaagtttgacaggggtcgatgtgattatcgacgatacaccagaagtggtgaccrrgtcac^ 

ATCCGATTCGTCGTGAGATTGCCCGTATGACTATGGAAATGTTGCTCAAAGATGGTCGTATACATCCAGCTCGTAT 
CGAAGAGTTGGTTGAGAAAAACCGTCAAGAGATTGACAATAAGATTCGTGAATACGGTGAGGCrCCTCCCTATCA 
AATTGGT^CGCCAAACCTTCATCCAGACn'GATGAAGATTATGGGACGTTTGCAGTTCra 
b:> AATGrmGCGCCATrCGATTGAGGTTGCTAAGTTGGCTGGTATCATGGCGACCGAACTTGGTGAAAATGCGGCT^ 
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TTGCCCGTCCrrGCTGGATTCCrTCACGATATCGGGAAAGCCArrGACCATGAGGTTGAAGGTAGCCAC^ 
CGGTATGGAATTGGCCCGTAACTACAAGGAACCCCCAGTTGTGG7GAATACGATTGCTAGTCACCACGGAGATGT 
TGAAGCrGAGAGCGTGATAGCAGTTATCGTCGCTGCAGCAGATGCCTTGAGCGCAGCCCGTCCAGGTGCrCGTAG 
TGAGTCTCrrGAAAGCn-ACATCAAGCGTCTCCATGATTTGGAAGAAArrGCTAACGGCm 
5 AGOrrrGCCCrrCAAGCAGGACGTGAAATTCGTATCATGGTCAATCCAGGAAAAATCAAGGAC 

ATCTTGGCTCACAAAGTTCGTAAGAAAATTGAAAACAATCTCGArrATCCAGGAAATATCAAGGTAACCGTGAT^ 
CGCGAGCTTCGTGCAGTAGATTATGCTAAATAA 

4158.7 

1 0 atg atgttaaaaccctctattgatacctrgctcgacaaggttccrrcaaaatattcactcgtaatc^ 
aacgtgcccacgaattggaagcaggtgccccagcaactcaaggtttcaagtctgaaaaatca^ 
tagaagaaatcgaatcaggaaacgttacaattcacccacatccagaaggaaaacgtgaagcagtgcgtcgccgta 
tcgaagaagaaaaacgccgcaaagaagaagaagaaaagaaaatcaaagagcaaattgctaaagaaaa^ 
tggtgaaaaaatttaa 

15 

4161.1 

ATGTCAGCATATCAATTACCGACCGTATGGCAGGATGAAGCTAGTAATCAAGGAG CnTI 'ACGGGGCTAAACAGA 
CCAACAG CAGGTGCCCGTTTCGAACAAAACTTGCCAAAAGG AG AACAAG C in i CAGCTTTATrCACTGGGAACA 
CCAAATGGTGTGAAGGTTACTATCTTATTGGAAGAATTACTAGAAGCTGCTTTTAAGGAAGCGGOT 

20 ATAAGATTGCTATCATGGATGGGGATCAATTCGGATCAGACTTTGTGAAGCTCAATCCAAATTCCAAGATTC^^ 
CirATTGGACCAGTCAGGTACTGAAAACGTAAGAGTaTrGAGTCTGCTCATATrCTTCm 
TTTGGAGCCn 1 ACCAAGTAATCCTGTGGAAAAGGTAGAAGTTITGAATTGGCrATTCTGGCAAGCACGTCCAG 
CACCTTTTCTAGGTGGGGGATTrGGACATTTCTTCAATTATCCTCCTGAAAAATTGGAATATCCTA^ 
ACGATGGAAGTGAAACGCCAGTTGGATTTATTGGATAAGGAATTGGCTCAGAAACCTTATATTGCAGGCAATGAC 

25 TATACGAXTGCAGATATTGCTATCTGGTCTTGGTATGGACAGTTAGTTCAAGGAAATCnTTACCAAGGr^ 
AATTCiTGGATGCCTCAAGTTATCAAAATCrAGTAAAATGGGCAGAAAAAATTGCCAATCGTCCAGCT 
GTGGCTTCGAAGTAACTTATACAGAAATTAAATAG 

4161.2 

30 ttggcaagcttgatcacttctatcatcatgtrctatctcggtttccatgttctaagagataccattc/^^ 
cactcgggaagaaacggtcattgatcctcttggtgcaactctaggaatcatttctgcagcgattatc 
tatctctacaatactcgccrcagtaagaaatccaactccaatgcgctgaaggcagctgctaaggacaatcttct 
acgctgttacctcacttcgaaccgccattgcca 
tgctatcatcatcactttctttatcrtcaacactcc 

35 gctttgacgaccgcctgctcgaggactaccaaaaggctatcatggaaattcccaaaatcagcaaggtcaaa^^ 
aaagaggtcgcacctacggtagcaac^tctacctggatattacactagagatgaatcctgacttgtctgttt^ 

AAGCCATGAAATCGCGGATCAGGTCGAGTCTATGCTGGAGGAGCGTTTTGGCGTCTTTGATACCGATGTCCATATC 
GAACCAGCACCrATCCCTGAGGATGAAATITrAGACAATGTCTATAAAAAATTGCTTATGCGTGAACAAT^ 
ACCAAGGAAACCAACTAGAAGAACrCrrGACTGATGATTTTGTCTATATTCGCCAAGATGGAGAGCAGATGGATA 
40 AAGAGGCTTATAAGACCAAAAAAGAGTTAAATTCTGCTATCAAGGACATTCAAATTAC^ 

CCAA ACTCATCTGCTATGAGTTAGATGCTATCATCCATACCAGTATCTGGCGTCGCCACGAAACCrGGCAAAATAT 
CTTTCATCAAGAAACCAAAAAAGAATAG 

4162.1 

45 ATGACAATTAAACTAGTAGCAACGCATATGGACGGAACCTTCCTAGATGGGAATGGACGCrTTGATATGGATCGT 
CTCAAGTCTCTCXrGGTTTCCTACAAGGAAAAAGGGATTTACnTTGCGGTAGCTTCGG 
TAGAAAAATTATTTGCrGGTGTTCGTGATGACATTATTTTCATCGCGGAAAATGGCAGr^ 

TCAGGACTrGTATGAAGCGACTATGTCTCCTGACri'l'l ATCTGGCAA Cl i 11 GAAAAGCTG AAAACTTCACCTTAT 

gtagatatcaataaactgctcttgacgggtaagaagggttcatatgttctagatacggttgatgagaccratttga 
50 aagtgagtcagcactataatgaaaatatccaaaaagtagcgagtttggaagatatcacagatgacattttcaaat 

TTACAACCAACnrCACAGAAGAAACGCTGGAAGATGGGGACCCTTGGGTAAACGAj^^ 
CCATGACAACTGGCTTTGAATCCATTGATATrGTTCTOTACrATGTCGATAA 

TAAAAAACrtGGTATCACAATGGATCAGGTCATGGCTTTTGGAGACAATCTTAATGACrrACATATGATG^^ 
GTGGGACATCCTGTAGCTCCTGAAAATGCACGACCTGAAATTTTAGAATTAGCAAAGACTGTGATTGGTCACCATA 
55 AGGAACGGTCGGrrATACCTTATATCGAGGGCTTATAA 

4162.2 

ATGGCAGATATAAAATTGArrGCATTGGACTTGGACGGGACCITGCTGACTACTGATAAAAGGCTGACG^ 
ACCAAGGAAACCrrGCAAGCTGCGCGTGATCGTGGTATCAAGGTC^ 

60 atggatttctttctccatgagttagggactgacggtcaggaagatgagtataccattacrmaatggtggatta 
ttcagaaaaatacaggagaaatccttgataaaacagtcttttcatatgatgatgtgccacctttgtat^ 
agagaaattatcactgcctc ttga tgccatctcagaaggaacagtttatcaaatccaatcggaccaagaaagtct 
ttatgccaaattcaivrccagctttgacctttcttccagtggacn^ 
aaatccgtgactgcctttgctcaagaacccttggatgcagcca 

65 atgaaatctttaaatcacgtgaaatgttgctagaatggtcaccaaagaatgttcataaagcaacaggtt^ 
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AACTAATCAGCC>TCTTGGAATCGACCAAAGTCAAGTGATGGCTTGTGCrrGACGAGGCCAATGACCTCrCT^ 

TTGAATGGGCAGGTCTTGGTGTTGCTATGCAAAACGCTGTTCCTGAAGTAAAGGCAGCCGCAAATGTAGTGACG 

CGATGACCAACGATGAGGAAGCTGTCGCCTGGGCTATCGAAGAATATGTGCTAAAGGACAACTAA 

5 4164.2 

ATGGAAAGTTTACTTATrCTATrATTAATrGCCAATCrrAGCTGGTCTCTTrCTGAm 
AGGAGAAACACTTAAGTAAGAGCTTGGAGGATCAGGCAGATCATTTGTCAGACCAGTTGGATTACCCCnT^ 
AAGCCAGACAAGCCAGCCAGTTAGACCAAAAAGATTTGGAAGTGGTTGTCAGCGACCGTrrGCAAGAAGTGCGGA 
TTGAATTGCACCAAGGTCTGACCCAAGTCCGTCAAGAAATGACAGATAATCTCCn'CCAAACTAGAGACAAGA 

10 ACCAACGTCTCCAAGCCnTGCAGGAATCAAATGAGCAACGTTTGGAACAAATGCGCCAGACGGTCGAGGA^^^ 
TAGAAAAGACCTTGCAGACACGCTTACAGGCTTCCTTTGAGACAGTTTCTAAAC^ 
CCrrGGAGAAATGCAGACAGTTGCCCGTGATGTCGGAGCTCTTAACAAGGTTa'CrCTGGAA 
GATTCTGGGAGAATrGCAACTGGGGCAAATTATTGAAGACATCATGACACCTGCCCAGTACGAACGAGAATACGC 
AACGGTTGAAAACTCTAGTGAACGAGTGGAGTATGCCATCAAGTTACCCGGACAAGGCGACCAAGAATACGTCTA 

15 TCrGCCAATTGACTCTAAGTrrCCACTGGCAGATTATTACCGCTTGGAAGAAGCCTATGAGACAGG^^ 

GAGATTGAACGCrGTCGTAAGTCACTCCTAGCAAGCGTCAAGCGCTTTGCTAGGGATATrAGGAACAAGTACATA 

GCACCACCTCGGACGACCAATTTTGGAGTrTTGTrrGTTCCGACAGAAGGTCTCTACrCAGAA^ 

CGGT Cl ' lC ' i 1 r GATGATTTGAGACXiGGAAGAACAGATTATTGTTGCAGGACCAAGTACCCTATCAGCCCiU'CTrAA 

CTCCCTATCAGTTCGTTTCAAGACCCrrAATATCCAAAAGAGTGCCGACCATATCAGCAAGACTCTTGCC^ 

20 AAGACCCAGTrTGGCAAGTTTGGTGGTATTCTGGTCAAGGCACAA^iLA.CATCT^ 

ATGAArTATTAAACCGTCGTACCATAGCTATCGAGCGGACGCTCCGTCACATTGAGTTGTCAGAAGGTGAGCCTGC 
GCTTGATCTACTCCATTTTCAAGAAAATGAGGAAGAATATGAAGATTAG 

4164.3 

25 ATGAAGATTAGTCACATGAAAAAAGATGAGTrATrrGAAGGCTTTTACCTAATCAAATCAGCTGACC^^ 
ACTCGAGCn'GGGAAAAACTACCTAGCCTrrACCTTCCAAGATGATAGTGGCGAGATTGATGGGAAGCT 
GCCCAACCTCATAACATTGAGGCCTTTACCGCAGGTAAGGTTGTCCACATGAAAGGACGCCGAGAACTTTATAAC 
AATACCCCTCAAGTCAATCAAArrACTCTCCGCCTGCCTCAAGCTGGTGAACCCAATGACCCAGCTGATTTCAAG 
TCAAGTCACCAGrrGATGTCAACGAAATrCGTGACTACATGTCGCAAATGATTTTCAAAATTGAAAATCCTGTCT 

30 GCAACGGATTGTCCGAAATCTCTACACCAAGTATGATAAGGAATTCTACTCCTATCCAGCTGCCAAGACCAACCA 
CCATGCCTTTGAAACGGGCrrGGCCTATCATACGGCGACCATGGTGCGTTTGGCAGACGCTATTAGCGAACTT^ 
CCTCAGCTCAATAAGAGCCTGCTCTATGCGGGGATTATGTTGCATGACTTAGCTAAGGTCATCGAGTTGACGGGGC 
CAGACCAGACAGAGTACACAGTGCGAGGTAATCTTCnTGGACATATCGCTCrCATTGATAGCGAAATTACCAAGA 
CAGTTATGGAACTCGGCATCGATGATACCAAGGAAGAAGTCGTTTTGCTTCGTCATGTCATCCTCAGTCACCACGG 

35 CTTGCrTGAGTATGGAAGCCCAGTCCGTCCA CGCA TTATGGAAGCAGAGATTATCCATATGATTGACAATCTGGAT 
GCAAGCATGATGATGATGTCAACAGCTCTTGCrrTGGTGGATAAAGGAGAGATGACCAATAAAATCTrCCCT 
G ATAATCGTTCCTTCTATAAACCAG ATTTAG ATTAA 

4166.2 

40 atgagtgaaaaagctaaaaaagggtttaagatgccntcatcttacaccgtattattgataatcattgct 

cagtgctaacrrggtttatccctgcgggggcctttatagaaggtatttacgagactcagcctcaaaatcc^^ 
gatttgggatgtcctcatggcaccga7tcgggctatgctaggtactcatccagaggaaggttcgctcattaaagaa 
acgagcgcagcgattgatctagccttcttcatccttatggttggtggtttccttggcattgtcaacaa^ 
ctcttgacgtagggattgcctctatcgtgaagaagtataagggccgcgaaaaaatgttaattttggtactgatg 

45 ttrgtttgccctcggtggtacaacntatggtatgggtgaagaaacaatggccttcratccact^ 

atgatggccgttggttttgatagcctgactggtgttgcaattattttgctcggttctcaaatcggctgm 
tactctgaatccatttgcgacaggtattgcntcagcgactgcgggagttggtacaggggacggtatcgtacttcgt 
ctgatcttctgggttacctrgactgctcrragtacttggtttgtttaccgttatgccgataa^^ 
gactaagtcactggttratagtactcgcaaagaagatttgaaacactrtaacgtagaagaatct^ 

50 tctacachtagcagcaaacaaaaatcagtictct^ 

tccatggacagaccttggcgttaccatrrttgatgactttaatacttggttgactggtcttccag^ 
ttgtcggttcatctacttctgcactaggtacttgctacrrcccagaaggcgcaatgcrctttgcc^^ 
ctgarrggtgttatttatggtcttaaagaagataagattatctcttccttcatgaatggtgctgctgac^ 
tgttgccrrgatcgtagcgattgctcgtggtattcaagttatcatgaacgacggtatgattaccgatacaatcctc 

53 aactggggtaaagaaggcrrgagcggtctatcttcacaagtctttatcgttgtaacttatatc^ 
gtcattcrtgatcccatcttcatcrggtcttgccagcccaactatgggtatcatgcc 
tccacttggagaatttgtaaatgtccgtcctagcttgattatcactgcttaccaatctgcttcaggtot 
tcattgcaccaacatctggtattgtgatgggagcrcttgcacttggacgtatcaacattggtacttggtggy^ 
catgggcaaactcgtagtcgctattattgtagtgaccatcgcc c 'i 1 l ' n 'ctccttggaaccrrccttccattcctat 

60 AA 

4166.3 

ATGAAAATAGATATAACAAATCAAGTTAAAGATGAATrrCrTATATCATTAAAAAC 

TACTCAATGAAGGAGAAAATGGAACACCTTrrGGACAAGCAATCCAAGATGTCCTAGAAAAAACTTTAGAG 
65 GTCGAGACATAGGTTTCACTACCTATCTTGACCCTAAAGGTTATTACGGATATGCAGAAATCCGTCAGGGAGCAG 
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AGCrrCTGGCCATTCTCTGTCArrTGGATG I I'GT TCCATCAGGTGATG AAGCAGATTGGCAGACACCGCCATrTGA 
AGCAACTATCAAAGaCGGCTGGGTATTCGGACGTGGTGTCCAAGATGAT.\AAGGCCCTTCGCTCGCAGCTCTC^^ 

tgcagtaaaaagcttgctggaccaaggtattcagttcaaaaagcgcgtacgctttatct^ 
aaccctctggcgctgcatggcacgctacaataccatcgaagaacaggccactatgggctttgcacctgactcatc 

5 TTTTCCTCTGACCTATGCTGAAAAACGGCrrTCTACAGGTCAAACTTCATCGCCCTGGATCGGATCAACTA 
GAAGTAGGAGGCGCCTTTAACGTTGTACCAGACAAGGCCAACTACCAAGCTCTCCTCTATGAACAGGTT^ 
GGTCTCAAAGAAGCTGGTTATGATTACCAAACCACTGAACAAACCGTAACGGTTCTCGGAGTCCC^ 
AAGGATGCTAGTCAAGGTATCAATGCTCTCATCCGACTAGCTACCATTCTTGCTCCrCTCCAAGAACACCCT 
TCAGlU'i ICT'rGCAACACAAGCAGGTCAAGACGGCACAGGAAGACAAATCmGGTGATATAGCAGATGAACCTT 

10 CTGGTCACCTATCCTrTAATGTCGCAGGTCTCATGATCAATCATGAACGTTCTGAAATCCGTATTGACATTCGGAC 
TCCTGTCTTAGCTGACAAGGAAGAACTAGTAGAGTTGCTTACAAGATGTGCACAAAACTACCAACTCCGCT 
AGAGTTTGACTATCTAGCGCCTCTATACGTCGCAGAAGACAGTAAACTCGTTAGCACACTGATGCAAATCTACCA 
AGAAAAGACTGGCGATAACAGTCCTGCTATTTCATCCGGTGGTGCCA Ci i 1 1 GCTCGC ACCATGCCAAATTGTGTA 
GCCTTCGGCGCCTTATTCCCAGGAGCGAAGCAGACAGAACATCAGGCAAATGAATGTGCCGTTCTAGAAGATTTG 

1 5 TACCGTGCTATGGATATTTATGCCGAAGCCGTCTATCGACTTGCAACTTAA 

4169.1 

ATGTCTAATTCATTTGTCAAGrrGTTAGTCTCTCAATTATTTGCAAATTTAGCAGAT AI ' r 
CATTGCTAACA TATAC ATTATTTCAAAATCAGTAATTGCCACATCACTAGTTCCTATCTT^ 

20 TTGTTGCGAGT Li 1 1 1 AGTrCCGTTGGTrACTAAAAGGTTAGCGCTAAATAGG Gin - l ' lA TCTTTATCTCAATTTGGA 

AAGACTATATTATTGGCGATACTGGTAGGAATGTrrACCGTAATGCAATCCGTAGCGCCTTrGGTGACCrATC^^ 
TTGTTGTTGCAATTTCCATACTAGATGO i'i GCAGCACCCGTTTCCTATGCTATTGTGCCACGCTATGCG 
TTGGGTAAGGCTAATTCAGCCrrrATCAATGACTGGTGAAGCrGTTCAATTGATAGGTTGGGGATT^ 
TGTITGCAACAATTGGTCTGTrACCTACCACGTGTATCAArrTAGTCnTGTATATCATTTC^^ 

25 TTTCTTCCTAACGCTGAAGTGGAGGTGTTAGAGTCAGAAACTAATCTTGAAATTrrGCT 

TTGCTAG'AAATC CTAGAT TAAGA Ci n ' i ' i 'GTATCAGCAAATITATTGGAA Al n - fn CAAATACGATTrGGGTrTCT 
TCCATTATACTTGTTTTTGTAACGGAGTTATTAAATAAAACGGAAAGTrACTGGGGATATT 
CTATTGGTATTATAATrAGTGGCTTAATTGCrmAGGCTATCTGAAAAGTrCCTTGCTGCTA 
ATTATTC ACCCCA AATCTAAAAACCATCCAGAAT CCTTG CCTTAGCTTAGATCCTGGATGG rri Cri ' l ' lTl CACCCA 

30 ATGGGTGTTTTTTACT AGAC AAAAAAGAGTTTCCCCTTTATGCTATAAGTGTAGAAAAAAACACA 

AAACTCAC ATG AACAGTTTACCAAATCATCACTTCCAAAACAAGTL: i ■nTrACCAACTATCTTTCGATGG AGGTCA 
TTTAACCCAGTATGGTGGTCTTATCl rri-i ICAGGAACl I i ii ICCCAGTTGAAACTAAAAGAGCGGATTTCTAAGT 
ATTTAGTAACGAATGACCAACGCCGCrACTGTCGTTATrCGGATTCAGATATCCrrGTCCAGTTCCTCT^ 
TTAACAGGTTATGGAACGGACTATGCrTGTAAAGAATTGTCAGCTGATGCCTACrT 

35 TCCAAAATTGTTGGAAGGAGGGCAGCrrGCTTCACAGCCAACCTTATCCCG 

ACAGTCCATAGTTTGCGATGCCTCAACCrrGAATTGGTCGAATT C Vi ' iTr ACAGTTTCACCAGCTAAACCAACTCA 
TTGTAGATATCGATTCTACCCATTTCACAACTTATGGCAAGCAAGAAGGTGTTGCTTATAACGCCCACTA 
TCATGGCTATCATCCTCTTTATGCITTCGAGGGGAA.GACAGGTTATTGTTTCAATGCCCAGC^ 
GTTATT GTrCT GAAGAGGCAGACAGCnTrATCACACCTGTTITAGAACGGTTTAATC^ 

40 TAGTGGCTTTGCGACCCCAAAATTATACGATTTAATTGAAAAAACAGGGCAATACTACCrCATAAAACTCi^ 
AAATACTGTTCTGAGCCGTCTTGGAGACCTTTCCCTCCCTrGCCCACAGGATGAGGACTTAACCATCTrGCCC 
TCCGCCTACTCAGAAACTCTCTATCAAGCAGGATCrrGGTCGCACAAGCGTCGTGTCTGCCAGTTCrCTGAACGAA 
AAGAAGGAAACTTGTTCTACGATGTTATTTCTCT 

GCTTTATCGTGGACGTGGTCAAGCCGAGAATrrCATCAAGGAGATGAAGGAGGGATTTTTTGGCGATAAAACGGA 
45 TAGTTCAACCTTAATCAAAAACGAAGTCGTATGATGATGAGCrGTATCGCCTACAATCTCT 
CATCTAGCTGGAGGTGACTTCCAAACTTTAACAATCAAACGCTTCCGCCATCTTTl^ 
GTGTTCGAACAGGACGCAAGCAGCTCCTCAAATTGTCTAGTCTCTATGCCTATrCCGAATTGTTl^ 
TTCTAGGATTAGAAAAGTCAACCTGAATCTTCCTGTTCCTTATGAACCACCTAGAAGAAAAGCGTCGTTAATC 
CATTAA 

50 

4169.3 

atgatggauu 1 1 i i c aacagctt cctca tttagagccatatggcaatcctcagta i ' rrigi n atgigattgctgc 
aaccttgcccatctttataggtctctttttcaagaaacgctttccctggtatg^ 
ttgtcaccatgttggtgggtcgaaagaccaatcaactagctgccttgggtatt^ 
55 cctgcrrrrctacaagcattatcgaaaaagcaaggatggcaaot 
cttccgattatctttgtcaaggtgcaacc^gctatcaatggaa 

cctgacctttcgrrcggttggaattgtcatcgagctgagagatggagtgattaaggattttaccctct^ 

ctccgtttccttctcntcatgccaacttrctcgagtggtccaatcgatcgctrraagcgat^ 

ggctattcctgagcgaga tgagt rgatggatatgctggatgaatctgtccgctatatcatgtggggctitngtat 

aagtttatcctagctcatgttttaggagagaccttactacctcctct 

tgtttaatctctatgccttggcagttatgtatacttrtggtctggaactct^ 

TTGCrrrGGCCATCTCAAACTTGATGGGAATCCGTAGCCCTATCAACTrrAACAAGC 
AAAGGAGTTTrGGAATCGCTGGCATATGAGTCTGTCCTTCTGGTTCCGTGACTT^ 

GTGTT AACC AGAAAGAAAGTCITTAAAAATCGTAATGTAACCTCAAGCATGGCCTACATTGTAAATATC 
DO TGGGATTTTGGCATGGTGTGACCTGGTACTATATCGCCTATGGACTCTTTCATGGACTAGGCTTGCTC 
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TGCCTGGGTrCGCAAGAAAAAAACGCTCAATAAGCAACGGAAAAAAGCAGGGAAGGCTCCOT^ 

GCTGGATTCAGrrCCTrGGCATGGTrGTCACTrrCCATGTTGTCATGTTGTCArrCTr^^ 

ATAATCTATGGTTTAAAAAATAA 

5 4169 4 

ATGCTTAAACGCTTATGGATGATCTTCGGACCGGTCTTGATCGCTGGTTTGTTGCT^^ 

CCrACTGAGATGCATCATAATCTAGGAGCTGAAAAGCGTrCAGCAGTGGCTACTACTATCGATAGTm^^ 

GAAGTCAAAAAGTCAGAGCACTATCTGATCCAAATGTGCGTrTTCTTCCCTTCT^ 

TrrTGACGGTGCTCATCCTGCGGTATTAGCTGAGAAATACAATCGrrCCTACCGTCCrTATCT^ 

10 ggagctgcatcgcttaaccaatattttggaatgcaacagatgttaccacagctggagaat^^ 

GTTATCTCACCTCAGTGGTTCAGTAAAAATGGCTATGATCCAGCAGCCTTCCAGCAGTATmAAT^ 

TGACTAGTTTrCTGAAACATCAATCTGGGGATCAGGCTAGTCAATATGCAGCGACTCGCTTACT^ 

AAACGTAGCTATGAAGGACCTGGTrCAGAAGTTGGCAAGTAAAGAAGAATTGTCGACAGCAGACAATGAAATGAT 

TGAATTATTGGCrCGTTTTAATGAACGCCAAGCTrCCTTTTTrGGTCAGT^ 
1 5 ATAAGCATGT AGCTAAGTATTTAAAAATCTTGCCAG ACCAGrrrrCTTATCACGCAATAG AAG ATGTTCT 

AGATGCTGAAAAAAATACTTCCAATAATGAGATGGGAATGGAAAATTATTTCTATAATGAGCAGATCA^ 
TTTGAAGAAATrAAAGGATTCTCAGAAAAGCTITACCTATCTCAAGTCGCCAGAGTATAAT^ 

ttaacacagttttctaaatctaaggtaaacccgatttttatcattccacctgttaata;^^ 

CTGGTCTACGAGAGGATATGTACCAACAAACGGTGCAGAAGATrCGCTACCAGTTAGAAAGTCAAGGTmACCA 
20 ATATAGCAGATTTTTCTAAGG ACGGCGGGG AGCCr ricr I ' I ATG AAGGACACCATTCACCTTGGTTGGTTGGGTTG 

GrrGGCTrrrGACAAGGCAGTTGATCCrrTCCTATCCAATCCCACACCAGCTCCGACTrAC^^ 
TTTTTCAGCAAAGATTGGGCGACTTATGATGGAGATGTCAAAGAATrrCAATAG 

25 IrGGAGAAAAACCTCAAGGCTTrGAAACAAACAACAGACCAAGAAGG 

GGATACCAAGACAGTCCAAAATGGTTACrTCGAGGATGCAGCTGTCAAGGACCGCACCTTGAGTGACTATCCAGG 
TAACTGGCAATCAGrrTATCCTTTCCTTGAAGACGGCACGTTTGACCAAGTCTTTGACT^ 

GGTAAGATGACCCAGGCTGAGTACAAGGCTTACTATACAAAAGGCTATCATACAGATGTGACTAAGATTAACAT^ 
ACTGATAATACTATGGAATTTGTTCAAGGTGGACAAAGCAAGAAATACACTTACAAGTATGTCGGTA^^ 
30 TTGACTTACAAGAAAGGCAATCGTGGCGTGCGTTTCCTCTTTGAAGCCACAGATGCTGATO^ 

ATGTTCACTTTAGTGACCACAATGTTGCCCCAGTTAAGGCAGAACATITCCATATCTTCriTGGAGGCAC^ 

AGAAGCCCTCTTTGAAGAAATGGACAACTGGCCAACCTACTACCCAGATAACCTATCTGGCCAAGAAA 

AGAAATGTTGGCGCATTGA 

^ ATGAAAGATGGTCATrrGCTAGCCCATCATATrCGT^ 

ATCCTGAGGCTCrrTATAGGGGTGAACAGGGCAAGATTTTAGCGGTTrTATGGAATAGTGAAACT^ 
CTGCGACAGATATCGCGCTTGCGACTGGACTTGCGAATAATACGCTGACGACTA TGAT AAAAAAGCTAGAGGAAC 
AAAAGCTTGTAATTGTTAGTCCGTGTGGAAAAGACAAGCGTAAGAA GTATT TAGTTTTA AC^ 
40 CCCAGAAAGAAGTGGGGCATCCTGTCAGTCAGAAATTGGATACTATCTTTTACAAAGGATT^ 
TTCACCAATTTGAAGGTTTTCAAGAAAGAATTTTGGCGAATCTGAAAGAGAAG^^ 

4170.4 

ATGACTAATTTAATTGCAACTTITCAGGATCGTTTTAGTGATTGGT TGACAG CT 

45 GCTrrTGACCTTGTTACTAGCTATTTTGCTrGCGATTCCCnTGGCTGin-^ 

ACTGGGTCTTGCAGATrGCAGGTATTTTCCAGACCATCCCGTCTCTGGCCTTGTTGG^ 
GGAATTGGGACCTTGCCGGCTTTGACAGCTCTAGTGATTTATGCGATriTCCCTATrrrGCAA^ 
GGCrrGAAGGGAATTGATCCGAACCTG(>AGAGGCTGGGATTGCCTTTGGGATGACCAGATGGGAACGTCTCAAGA 
AATrTGAAATTCCACTCGCCATGCCTGTTATCATGTCTGGGATTCGGACGGCAGCTGTTTTGATTATCGCT 

50 AACCTTGGCGGCCTTGATTGGTGCAGGGGGACTAGGTTCCTTT ATTCn TrrGGGAATTG 

TTG ATTTTG ATTGGGGCA Cr 1 " i ' C 1 ' i CTGCAGTGCT AGCCATTGCCTTTAACTTCCT ACT AAAAGTG ATGG A AAAAG 
CAAAATTACGGACGATTTTCTCAGGTTTTGCCTTGGTGGCTTTATTACTGCGTCTGTC^ 
GTTCAAAAAGAGAAGGAAAACTTGGTTATTGCTGGGAAAATAGGTCCAGAACCAGAAATTTTGGC^ 
AAGTTGCTGATTGAAGAAAATACCAGCATGACTGCGACTGTTAAACCGAATmGGGAAGACAAGCTTCCT^ 

55 AAGCTCTGAAAAAAGGCGATATTGACATCTATCCTGAATTTACTGGTACGGTGACTGAAAGTTTGC^ 

ACCCAAGGTGAGTCATGAACCAGAACAGGTrTATCAGGTGGCGCGTGATGGCATTGCTAAGCAGGATCATCTAGC 
CTATCTCAAACCCATGTCTTATCAAAACACCTATGCTCTAGCTGTTCCGAAAAAGATTGCTCAAGA^ 
AAGACCATTTCAGACTTGAAAAAAGTGGAAGGGCAGTTGAACGCAGGTTTTACACTCGAGTTTAA^^ 
GATGGAAATAAGGGCTTGCAATCAATGTATGGTCTCAATCTCAATGTAGCGACCATTGAGCCAGCCCrrCGCTATC 

60 AGGCTATTCAGTCAGGGGATATTCAAATCACGGATGCCTATTCGACTGATGCGGAATTGGAGCGTTATGATTTACA 
GGTCTTGGAAGATGACAAGCAACTCTTCCCACCTTATCAAGGGGCTCCACTCATGA^ 

ACACCCAGAGTrGGAAAGAGTTCTTAATACATTGGCTGGTAAGATTACAGAAAGCCAGATGAGCCAGCTCAACTA 
CCAAGTCGGTGTTGAAGGCAAGTCAGCAAAGCAAGTAGCCAAGGAGTTTCTCCAAGAACAAGGTrTGTTGAAGAA 
ATGA 

65 
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4170.5 

ATGATGCATACTTATTTGCAAAAGAAAATTGAAAATATCAAAACAACCCTAGGTGA-AATGTCAGCT 
CGTATGGTTGCGGCTATGGCTGATTTAGGATTTTCAGGAACTATGAAGGCTATCTGGCATGACCTC^ 
GTA Gl i i rG CCCAGTGGATTrATTrGCTG Gi ' i 1 i AGGAA Grn TCCTCTCTGGCTGGAGTTGGTTTACGAACATCGT 
5 ATTGTTGACTGGATrGGGATGATTTGTAGCTTGACAGGGATTATCTGTGTAATCTTTGTATCGGAAGCT^ 
GTAATTATCTTTTTGGCrrGATTAACTCTGTTATTTACCrrAT^ 

CTGACGACACrrTACTTCACAGTCATGCAGCCAATrGGACTrCTAGTTTGGATrrATCAGGCACAG^ 
AAAAGCAGGAGTTTGTCGCGCGTAAACTGGACGGCAAGGGCTGGACAAAGTATCrrrCCATTAGTGTGC^ 
GGTTGGCCTTTGGCrTCATTrATCAGTCTATTGGTGCCAATCGTCCCTATCGTGAT^^ 
10 TGGGGTAGGGCAAATCCTCATGACAGCTGrnACCGTGAACAGTGGATATrCTGGGCGGCTACC\ATGTC^^ 
ATCrATCTCTGGTGGGGAGAAACCCTGCAAATTCAACGGAAATATCTAATTTATCTCATTAACAGTCTAGT^ 
GGTATCAATCGAGCAAGGCAGCTAAGCAGAATACTGATTTACTTAACTAG 

4170.6 

15 ATGAGAAATATCAAGGCAAAATATGCTGTITGGGTGGCTTTTrrCTTA;^ 

TGCAGGTGG AGTATTrGGTTCT AGCGCTG' 1' lU 1 1 GCTG ACTCTGTGCATG ACTTGGG AG ATGCG ATTGCAATTGG A 
ATATCAGCriUU'CTAGAAACAATCTCCAATCGTGAAGAAGACAATCAGTACACCTTGGGCTATAAGCGCTTTA 
TGCTAGGAGCCTTGGTAACAGCTGTGATTCTCGTAACGGGCTCTGTTCTAGTCATmGGAAAATCT^ 
TTTGCATCCGCAACCAGTCAATGATGAGGGGATTCTCTGGTTAGGAATTATTGCGATTACTATCAATCrGrrAGCG 

20 AGTCTGGTGGTTGGTAAGGGAAAGACAAAGAATGAGTCTATTCTGAGTCTGCATrrrCTGGAAGATACGCTAGGG 
TGGGTAGCTGITATCCTGATGGCGATTG 

CATirCTTTCTrrATTCTTTCAAAAGCCCnTCCACGTTTTrGGTCTACAC^ 

AAGGTCTTGATATCAAGCAAGTAAAGAGTGGCCTGGAGCGATTGGACAATGTGGCCAGCCTTAATCAGCTTAATC 
TCTGGACTATGGATGCmGGAAAAAAATGCCATTGTCCATGTTTGTCTAAAAGAAATGGAACATATC^ 
25 TAAAGAGTCTATTCGAATTTTCCTAAAAGArrGTCGTTTTO^AAATATTACCAT^^ 

ACTCACCi^AACCCATAAGCGAAAGGTGTGTGACrTGGAACGGAGTTATGAGCATCAACATTAG 

4170.8 

ATGATTGAATACAAAAATGTAGCACTGCGCTACACAGAAAAGGATGTCTTGAGAGATGTCAACTTACAGAT^^ 
30 GATGGGGAATTrATGGTTTrAGTAGGGCCrrCTGGCTCAGGTAAGACGACCATGCTCAAGATGATTAACCCTCT^ 
TGGAACCAACTGATGGAAATATTTATATGGATGGGAAGCGCATCAAAGACTATGATGAGCGTGAACTTCGTCTTT 
CTACTGGTTATGTTTTACAGGCTATTGCTCTTTTTCCAAATCTAACAGT^ 

ATGAAGGGGTGGAGCAAGGAAGAAATTACGAAGAAAACAGAAGAGLU"ll"I"GGCTAAGGTTGGTTTACCAGTAGCC 
GAGTATGGGCATCGCTTACCTAGTGAATTATCTGGTGGAGAACAGCAACGGGTCGGTATTGTCCGAGCTATGATTG 
35 GTCAGCCCAAGATTTTCCrCATGG ATG AACC C i IT I CGGCCTTGGATGCTATTTCGAGAAAACAGTTGCAGGTTCT 

GACAAAAGAATTGCATAAAGAGTTTGGGATGACAACGAl ITi IGTAACCCATGATACGGATGAAGCCrTGAAGTT 
GGCGGACCGTATTGCTGTCTrGCAGGATGGAGAAATTCGCCAGGTAGCGAATCCCGAGACAATTTTAAAAGCGCC 
TGCAACAGACTTTGTAGCAGACTTGTTTGGAGGTAGTGTTCATGACTAA 

40 4171. 1 

ATGTCAGCAGTTGCTATTrCAGCTATGACCAAGGTTATGCAAGAAACCCACGGAAATCCTTCTAGTATrCAT^ 
ATGG TCGTCA AGCTGGCAAACTCTTGCGAGAAGCCCGTCAGGAACTAGCCCAGTTACTAAGGACAAAACCTCAAC 
ATATCTTTTTCACTTCTGCTGGGACTGAAGGC^^ATAATACTACCATCATTGGCTACTGTCT^ 
AGGAAAACATATCATCACAACTGCCATCGAGCACCATGCTGTCCrrGAAACAATTGATTACTTGGTT^ 
45 GGGTTTGAAGCAACCATTATCCAGCCAGAAAATCAAGAAATCACAGCCCAGCAAATTCAj^^ 

GATACGATrrTGGTTTCTACCATGTTTGTCAATAATGAGACAGGAAACCTACrGCCCATCGCTGA^ 

TACTCAAGCAA CACCC TGCTGCCTATCATGTTGATGCAGTTCAGGCTATTGGTAAAATCCCAATrCATTCAGAAGA 

ATTGGGCATTGATTTTCTCACTGCTTCTGCCCACAAATTCCATGGTCCTAAGGGAATC^ 

GCATGGACTTTGATTCCTATCTACATGGCGGAGACCAGGAACAGAAAAAACGTGCAGGAACTGAAAATCrGCCT^ 
50 CCATTGTAGGCATCGTTGCAGCCCTAAAAGAAGACCTAGAAAAACAAGAAGAA<^ 

TAGAAACTGCCnTrCTGGCAGAGCTGGAGGGCATTCAGTATTACCTGAATAGAGGAAAACATCATCTCCCTrA 
TCTCAATATTGGATTTCCTGGTCAGAAAAATGACCTCTTACTCCTTCGGCTAGATTTAGCT 

CTGGCTCAGCCTGTACTGCAGGCGTTGTCCAATCCAGCCATG! JCriGAAGCCATGTATGGCGCAAATTCAGAACG 
CrrGAAGGAATCCCTTCGCATCAGTTTGTCGCCACAAAATACCGTTGAAGACCTACAAACCCTCGCAAA^ 
55 AAAGAAATTATCGGAGGTTAG 



60 4172.1 

ATGTTATTCAAATT ATCTA AGGAAAAAATAGAGCTAGGCTrATCTCGTTTATCGCCAGCCCGTCGTA li ' rLl - ^ 
GT TITG CCTTGG TCATT TTACTAGGCTCTiJI lCrrin GAGCnTGCCCTTTGTCCau\GTTGAAAGCTCACGAGCGACT 
TATTTTGATCATCrmCACTGCTGTCTCTGCAGTCTGTGTGACGGGTCTCTCAACCC^ 
TAATATCTGGGGTCAAATAATCTGTITGCTCTTGATTCAGATCGGTGGTCTAGGGCTCATGACCTT^ 
65 TCTATATCCAGAGCAAGCAAAAGCTTAGTCrrCGTAGCCGTGCAACTATTCAGGATAGTTTTAGTTATGGAGAAAC 
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TCGATCmGAGAAAGTTTCrrCTArrCTATnTTCTCACGACCTTm 

GTTTTCCCCTTATrCCTCAACTTGGCTGGGCACGTGGTCTTTTTAG^^ 

AATGCCGGTTrrGATAATrrAGGGAGCACCAGTTTATTTGCTTTTCAGAC^ 

AGGCTTGATTATTACAGGCCGCCTTCO 1 ri i' ATGGTCTGGTTTGATTTGGCTGGTCATGTAGGAAGAAAGAAAAAA 
5 GGACGTCTGCACTITCATACGAAGCnTGTACTATTATTGACTATAGG nr c i ' i ' i GTTAnTGGAACA GCAACTA CrCT 

Cl ' llCr r GAGTGGAACAATGCTGGAACGATTGGCAATCTCCCTGTTGCCGATAAG G ' I ' riU AGTTAGCr 1 1'i 1 i'CAA 
ACAGTGACGATGCGAACAGCTGGCl i i i CTACGATAGATTATACTCAGGCTCATCCTGTGACT Ci 1 i I GATTTATA 
TCTTACAGATGTITCTAGGTGGGGCACCTGGAGGAACAGCTGGGGGACTCAAGATTACGACATT^^ 
GGTCTrrGCACCAAGTGAGCrrCTAGGCrrGCCTCATGCCAATGTTGCGAGACGAACGATC^ 
10 CAAAAATCCTTT AGTG TCTn'ATTAT CI T 1 1 TGATGAG C 1 1 CTi GATAGGATrGATTCTGCTAGGGATAACAGCCAA 

AGGCAATCCTCCCrrrATCCACCTCGTATTTGAAACCATTTCACCTCTTACTACAGTTGGTGTAACGGC^^ 
ACTCCTGACCTTGGGAAATTGGCTCTCAGTGTrATCATGCCACTTATGTTTATGGG 

ACGAATTGGTCCCTTGACCi iUl i IGTTAGCTTGGCAGATTACCATCCAGAAAAGAAAGATATGATTCACTATATG 
AAAGCAGATATTAGTATTGGTTAA 

15 

4172.2 

ATGTCAGATCGTACGATTGGAATTTTGGGCTTGGG AA I' 1 ' 1' 1 rGGGAGCAGTGTCCTAGCTGCCCTAGCCAAGCAGC 
ATATGAATATTATCGCTATTGATGACCACGCAGAGCGCATCAATCAGTTTGAGCCAGTnTGGCGCGTGGAGTGAT 
TGGTGACATCACAGATGAAGAATTATTGAGATCAGCAGGGATTGAIACCTGCGATACCGTTGTAGTCGCGACAGG 
20 TGAAAATCTGGAGTCGAGTGTGCTTGCGGTTATGCACTGTAAGAGTTrGGGGGTACCGACTGTTATTGCTAAGGTC 
AAAACTCAGACCGCTAAGAAAGTGCTAGAAAAGATTGGAGCTGACTCGGTTATCTCGCCAGAGTATGAAATGGGG 
CAGTCrCTACCACAGACCATTLl 1 U CCATAATAGTGTTGATGTCTTTCAGTTGGATAAAAATGTGTCTATCGTGG 

agatgaaaattcctcagtcttgggcaggtcaaagtctgagtaaattagacctccgtggcaaatacaatctgaa 
ttttg ggtt tccgagagcaggaaaattccccattggatgttgaatttggaccagatgacctcttgaaagcagatac 
25 ctatattttggcagtcatcaacaaccagtatttggataccctagtagcattgaattcgtaa 

4172.3 

atgaagttattgtctatcgcaatitctagctat^ 

ttggtggtgagcaagttgggattttgattatcaatgacgggtctcagcatcagactcaggaaatcgctgagtgti^ 
30 agctagcaagtatcctaatatcgt taga gccatctatcaggaaaataaatgccatggcggtgcggtcaatcgtgg 
cttggtagaggcrtctgggcgctattttaaagtagttgacagtgatgactgggtggatcctcgtgcct 

ATTCTTGAAACCn*GCAGGAACTTGAGAGCAAAGGTCAAGAGGTGGATGTCTTTGTGACCAATTTTGTCTAT^ 
AGGAAGGGCAGTCTCGTAAGAAGAGTATGAGTTACGATTCAGTCTTGCCTGTTCGGCAGA l ' ITr ' i GGCTGGGACCA 
GGTCGGAAATTTCTCCAAAGGCCAGTATACCATGATGCACTCGCTGATTTATCGGACAGATTTGTTGC^ 
35 CAGTTCTAA 

4172.4 

atgaaattcaatccaaatcaaagatatactcgttg 

tggctagtggcttcrrrgtcctagttggtcagccaagttctgtacgtgccgatgggctcaatccaaccccaggtca 

40 agtcttacctgaagagacatcgggaacgaaagagggtgacttatcagaaaaaccaggagacaccgttctcactca 
agcgaaacctgagggcgttactggaaatacgaattcacttccgacacctacagaaagaactgaagtgagcgagga 
aacaagcccrtctagtctggatacactttttgaaaaagatgaagaagctcaaaaa^ 
cttaaaagaaactgtagatacagctgatgtggatgggacacaagcaagtccagcagaaactactcctgaacaagt 
aaaag gtgg agtgaaagaaaatacaaaagacagcatcgatgttcctgctgcttatcttgaaaaagctgaagggaa 

45 aggtcctttcactgccggtgtaaaccaagtaattccttatgaactattcgctggtgatggtatgttaact^ 
ttactaaaagcttcggataatgctccttggtcrgacaatgctactgctajwuw^.tcctgc^^ 
gattaacaaaagggaaatacrrctatgaagtagacttaaatggcaatactgttcgtaaacaaggtc^ 
ttgatcaacrrcgcgctaatggtactcaaacttataaagctacrgttaaagtttacggaaataaagac^ 
tgacttgactaatctagttgctactaaaaatgtagacatcaacatcaatggattagttgctaaagaaacag^^ 

50 aaagccgttgcagacaacgttaaagacagtatcgatgttccagcagcctacctagaaaaagccaagggtgaaggt 
ccattcacagcaggtgtcaaccatgtgattccatacgaactcttcgcaggtgatggcatgttgactcgtct 
tcaaggcatctgacaaggcaccatggtcagataacggcgacgctaaaaacccagccctatctccactaggcgaaa 
acgtgaagaccaaaggtcaatacttctatcaagtagccrrggacggaaatgtagctggcaaagaaaaacaagcgc 
tcattgaccagttccgagcaaatggtactcaaacrtacagcgctacagtcaatgtctatggtaacaaagaccgta 

55 aaccagacttggacaacatcgtagcaactaaaaaagtcactattaacataaacggtttaatttctaaag 
ttcaaaaagccgttgcagacaacgttaaagacagtatcgatgttccagcagcctacctagaaaaagcc^ 
aaggtccarrcacagcaggtgtcaaccatgtgattccatacgaactcrrcgcaggtgatggtatgttgactcg^ 
cttgctcaaggcatctgacaaggcaccatggtcagataacggtgacgctaaaaacccagccctatctccacragg 

TGAAAACGTGAAGACCAAAGGTCAATACTTCTATCAATTAGCCrrGGACGGAAATGTAGCTGGCAAAGAAA^ 

oU agcgctcattgaccagttccgagcaaacggtactcaaacttacagcgctacagtcaatgtctatggtaac^^ 
cggtaaaccagacttggacaacatcgtagcaactaaaaaagtcactattaacataaacggtttaatt^ 
aacacttcaaaaagccgttgcagacaacgttaagacagtatcgatgttccagcagcctacctag 



4172.5 
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ATGAAACTAAAAACTTATATmGGTrGGATAT ATTATTT CAACCCTCTTAACCATm 1 1 i IG GGCTGT 

TCAAAAAATGCTGATTGCGAAAGGCGAGATTTACTTTT^ 
GGGATTAGmCTTTCTCCTATTGCCAGTCTrrACGTCGTTGGG 
CCAAGGATTTTCCrrCAAATTTGGAGGTTCAAGGTCCT^ 

GTCCCATGATTTGCAGGTAAGCTTTGATTCCnTGGAAGAAAGCGAACGAGAAAAGGGCTrGATGATO 
GTCGCATGATATTAAGACTCCTATC>.CTTCGATCCAAGCGACGGTAGAAGGGATTTTGGATGGGArrATC^ 
GTCGGAGCAAGCrCATrATCTAGC^ACCATTGGACCCCAGACGGAGAGGCTCAATAA^ 
TTTTTTGACCCTAAACACAGCTAGAAiVrC 

ATTGAGTGCATGAGTGAATTTCAGTTTITGATTGAGCAGGAGAGAAGAGATGTCCACTTGCAGGTAATCCC^^ 

CTGCCCGGATTGAGGGAGArrATGCTAAGCrrrCTCGTATCTTGGTGAATCTGGTCGATAACCC^ 

TGCTCCAGGAACCAAGCTGGAAGTGGTGGCTA AGCT GGAGAAGGACCAGCrrrCAATCAGTGTGACCGATGAAGG 

GCAGGGTATTGCCCCAGAGGATTTGGAAAATATriTCAAACGCCTTTATCGTGTCGAAACrrra 

ACAGGTGGTCATGGATTAGGACTTGCGATTGCGCGTGAATTGGCCCATCAATTGGGTGGGGAAATCACAGTCAGC 

AGCCAGTACGGTCTAGGAAGTACCTTTACCCTCGTTCTCAACCrCTCTGGTAGTGAAAATAAAGCCT^ 

4172.6 

ATGrrrGGTCAAACGGCTCAACATCGTCrrACGAATAGCCTGAAAGAC^ 

CACAATTGGCG i i i i 1 i iGCCAGATGCTCCGCTGTTCCAGATCGGTTGAGCAGGGTACTGGAAATCACCGTCGTGA 

GTTCAATATGATTCAGCAGATATTCTCGCATTTTGGGATGACTCACTTGGGACAAATCAAGTT^ 

TCGATrGACCTTGAGTTGCTGGTCAATGCACTTAATCATCACTTGCTCATTGACAGACTGGTCCT^ 

AAATAACGATAGAAATCGACAGGCAGATAGTACATGGTCTTGACCTGCTGAAGGGGCGTAAAGACAAAGAGA^^ 

TCGACATAAAAAGTATGTrCAGGCAGTTAGAACTGGCTAGCACGCAACAAATCTGTCCGATAAATCAGCGAGTC 

ATCATGGTATACTGGCCrrrGGAGAAATTTCCGACCTGGTCCCAGCCAAAAATCTGCCGAACAGGCAAGACT^ 

4174.1 

ATGGAACATTTAGCAACrrATTTITCAACCTATGGAGGAGCTTTOT 

CTCAAi^GTTTGCCTCAGCTrrGATATr^ 

CTATTGTAGGATACTTTTCAGCTAAGCATCAAGGAAATGT^^ 

CATTTTGACCCTTCGTGTATAA 
4175.2 

AAGACAGCATGGGACTTTACGATGTTTGAGGAAAAATCTTAA 
4175.3 

CAAGG^C^TO 
4174.4 

GCGA^^I??^^,^ 

I^^gISS.^^^^^ 
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ATGAAAATTCAAGACCTATrGAGAAAAGATGTCATGTTGCTAGATTTGCAGGCAACTGAAAAAACAGCTGTCATC 
GACGAGATCATTAAAAATTTGACAGACCACGGTTATGTAj^CAGATm 

CGTGAAGCnTTGACTTCTACTGGTTTGGGTGATGGAATCGCAATGCCTCACAGCAAAAACGCTGCT 

GCGACACTTCTATTTGCTAAGTCAAATAAGCGTGTTGACTACGAGAGCTTGGATGGACAAGCAACTGACCT 

TCATGATrGCAGCTCCAGAAGGTGCCAATGATACTCACTTGGCAGCCTTGGCAGAATTGTCTCAATA 

AGACGGTTTTGCAGACAAACTTCGTCAAGCAACATCTGCAGACCAAGTTATCGAA Ci ' ri ' l I GACCAAGCTTCAGAA 

AAAACTGAGGAACrrGTTCAAGCACCTGCTAATGACTCrGGTGACTTTATCGTAGCTGTTA^ 

GTATTGCCCACACTTACATGGCCCAAGAAGCCCTTCAAAAAGTAGCTGCTGAAATGGGGGTTGGTATCAAGGTCG 

AAACCAACGGTGCTAGCGGTGTTGGAAATCAACTAACTGCAGAAGATATCCGTAAGGCTAAAGCTATTATCATT 

CAGCAGACAAGGCCGTTGAAATGGATCGATTTGATGGAAAACCATTGATCAATCGTCCAGTTGCTGACGGTATCC 

GTAAGACAGAAGAGCTAATTAACITGGCTCTTTCAGGAGATACTGAAGTCrACCGTGCCGCTAATGGTGC 

CTGCAACAGCCTCTAACGAAAAACAAAGCCTTGGTGGTGCCTTGTACAAACACrrGATGAGTG 

TGTTACCATTCGTTATCGCTGGTGGTATCATGATTGCCCTTGCCTTCTTGATTGACGGTGCTTT 

GAAAACCTTGGCAiVrCTTGGTTCTTACCATGAG^ 

GATGCTTCCAGTCTTTGCGGGTTATGTTGCCTACTCTATTGCT 

GGTGCTATTGCCAAAGAAGGTTTTGCCTTTGGTAAAATTCCTTATGCCGCAGGTGGTGAAGCA^ 

CAGGTGTCTCATCTGGTTTCCTAGGTGCCCTTGTTGGTGGATTTATCGCAGGTGCCTrGGTTCT^ 

TACGTTAAAGTTCCTCGTTCACTCGAAGGTGCTAAATCAATCCTTCT-ATTGCCA Ci 1 Ci t GGAAC AATCTTGACAG 

GATTTGTTATGCTAGCTGTGAATATCCCAATGGCTGCAATCAACACTGCTATGAATGACTTCCtAGGCGGTC^ 

AGGAGGTTCA GCTGT CCrrCinGGTATCGTCCTTGGTGGAATGATGGCTGTTGACATGGGTGGACCAGTTAATAAA 

GCAGCTTATGTCTTTGGTACAGGTACGCTTGCAGCAACTG 1 I ' i C n CAGGTGGTTCTGTAGCCATGGCAGCAGTTA 

TGGCTGGAGGAATGGTGCCACCACTTGCAATCTTTGTCGCAACTLI ILI I'MCAAAGATAAATTTACTAAGGAAGA 

ACGTAACTCTGGTTTGACAAACATCATCATGGGCTTGTCATTTATCACTGAGGGAGCGATTCCATTTGGTGC 

GACCCAGCTCGTGCGATTCCAAGCTTCATCCTTGGTTCAGCAGTAGCAGGTGGACTCGTTGGTCTTACTGGTATCA 

AACTCATGGCGCCACACGGAGGAATCTTCGTTATCGCCCTTACnTCAAATGCTCrCCTTTACCTCGm 

GTAGGAGCAATCCTAAGTGGTGTGGrTTATGGTTACCTACGCAAACCACAAGCATAA 

4175,6 

ATGGCAAACAAGAATACAAGTACAACAA GACG GAGACCGTCTAAAGCAGAACTGGAAAGAAAAGAAGCGATTCA 

ACGA ATG TTGA TTTCGTTAGGAATTGCGATTTTATTGATTTTCGCAGCCTFCAAATTAGGGGCT 

CTTTATAATTTAATTCGCTTGCTAGTGGGTAGCCTAGCTTATCT 

TTTCAAGTGGATACGAAAACAGGAAGGACTCTTATCTGGCTTTTTCACCATATTTGCTGGCTTACT 

AGGCCTACTTGGTTTGGAAATATGGTT^ 

GACTGGTTTTCGAACGACTAGam 

TTTrCAAATATCGGA ACTTA CTTTATTGGTTCTATCTTGAl 1 i'l'AGTGGGTTCTCTCCTAGTCAGCCCTTGGTCTGTT 

TACGATATTGCTGAATTT^^CAGTAGAGGCTTTGCCAJ^ATGGTGGGAAGGGCACGAGCGTCGAAJ^ 

TTTGTCAAACAAGAAGAAAAAGCTCGCCAAAAGGCTGAGAAAGAGGCTAGATTAGAACAAGAAGAGACTGAAA^ 

AGCCTTACTCGATTrGCCTCCTGTTGATATGGAAj^CGGGTGAAATTCTGACAGAGGAAGCTGTTCAAAAT^ 

CCTATTCCAGAAGAAAAGTGGGTGGAACCAGAAATCATCCTGCCTCAAGCTGAACTTAAATTCCCTGAACAGGAA 

GATGACrCAGATGACGAAGATGTTCAGGTCGATTTTTCAGCCAAAGAAGCCCTTGAATACA^ 

CAACTCTT TGCAC CAGA TAAA CCAAAAGATCAGTCTAAAGAGAAGAAAATTGTCAGAGAAAATATCAAAATCT^ 

GAAGCAACCTTTGCTAGCTTTGGTATTAAGGTAACAGTTGAACGGGCCGAAATTGGGCCATCAGTGACCAAGTAT 

GAAGTCAAGCCGGCTGTTGGTGTAAGGGTCAACCGCATrrCCAATCTATCAGATGACCTCGCrCTAGCCTTGGCTG 

CCAAAGAT GTCCG GATTGAAGCACCAATCCCTGGGAAATCCCTAATCGGAATTGAAGTGCCCAACTCCGATATTG 

CCACTGTATCTTTCCGAGAACTAT GGGAA CAATCGCAAACGAAAGCAGAAAATTTCTTGGAA^ 

AGGCTGTTAATGGAACCGCAAGAGCTTTTGACCTTTCTAj^AATGCCCCACT^ 

AGGGAAGTCAGTAGCAGTTAACGGCATTATTGCTAGCATTCTCATGAAGGCGAGACCAGATCAAGTTAAATTTAT 
GATGGTCGATCCCAAGATGGrrGAGTTATCTGTTTACAATGATATTCCCCACCTCTTGATTCCAGTCGTGACCAAT 
CCACGCAAAGCCAGCAAGGCTCTCCAAAAGGTTGTGGATGAj^ 

GGAGrrCGGAATATTGCAGGTTTTAATGCCAAGGTAGAAGAGTTCAATTCCCAGTCTGAGTACAAGCAAATTCCO 
CTACCATTCATTGTCGTGATTGTGGATGAGTTGGCTGACCTCATGATGGTGGCCAGCAAGGAAGTGGAAGATGCTA 

tcatccgtcttgggcagaaggcgcgtgctgcaggtatccacatgattcttgcaactcagcgtccatctgttgatgt 

c atct ctggtttgattaaggccaatgttccatctcgtgtagcatttgcggtttcatcaggaacagactcccgtacg 

atrrtggatgaaaatggag cagaa aaacttcttggtcgaggagacatgctctttaaaccgattgatga/^ 

ccagttcgtctccaaggc tccttt atctcggatgacgatgttgagcgcattgtgaacttcatcaagactcaggcag 

atgcagactacgatgagagtttrgatccaggtgaggtt^ 

GTGGTGATCCGCITTTTGAAGAAGCTAAGTCTTrGGTTATCGAAACACAGAAAGCCAGTGC 

TCGTTTATCAGTTGGATTTAACCGTGCGACCCGTCTCATGGAAGAACTGGAGATAGCAGGTGTCATCGGTCCAGCT 
GAAGGTACCAAACCTCGAAAAGTGTTACAACAATAA 

4176-1 

ATG AGTTA TTTTAAAAAATA TAAAT TCGATAAATCCCAGTTCAAACTrGGTATGCGAACCTTTAi^ 
CTGTTTTTCTAGTT CTCr rGATTTrrGGCTTTTT^ 

GCCTGAGGGAGAGTTTTGATGAGAGTCTTCATTTTGGGACTTCGCGTATTCTAGGAAATAGTATCGGTGGACTCTA 
TGCCrrGGTCTTCTTOTATTAAATACCTTnTCCACGAAGCCTTTTGGGTGACOT 
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CCATGTTAACCATTATGACAAATCrrAGCCATGAATAACAAAGCAGGGGTTATTGGTCGTGTAGC^^ 

CATTACCCTATCAATTCCAAGTGGTGAGACAATTTrGTACGTCTTTGTGCGTGTATTAGA^ 

TTTGTCGCAATTATCGTAAATTACCATATTGATCGTATTCGTCTCTTm 

4178.2 

ATGAATAAATCAGAACACCGCCACCAACTTATACGCCCTCTTATCACAAAAAACAAGATTCATACACAGGCT^ 

TTGCAAGCCl-i lui iGCTGAGAACGACATTCAAGTAACCCAGGCAACCCTCTCACGCGACATCAAAAATATGAAC 

CTATCAAAAGTCCGCGAAGAAGATAGCGCTTATTATGTTCTTAACAATGCTTCCATCTCAAAATGGGAAAA^ 

TCGAACTCTACATGGvyiGACGCCCTTGTCTGGATGCGCCCAGrrCAACACCAAGTCCT^ 

ACTGGCTCAATCCrrrGGTTCrATCATTGATACmGAGCTTCCCTGACGCTATCGCrACCC^ 

TCTGTCTTATCATCTGTGAAGATGCAGATACTGCTCAAAAGTGCTTTGAAGAACTGAAAAAArrCGCC^^ 
TTTCTTTGAAGAATAA 



4179.1 

15 ATGAAAAGTATAAAj\rrAAATGCTCTATCTTACATGGGAATTCGTCTCTTGAATATTATT^ 
AACCTVapTCGCGCGTGTCTTGGACCGAACTGACTATGGTTACTTCAACTCA 
TGCCCTTTGCAACTTATGGT^^ 

TAACAGAAC un i i CTAGTCi im i ATrrGTGCATCGCTTGTACGATTTTGACCACTGCTGTCTATATCCTAGCCT 
on ATCCTCTui iL.ii i ACTGATAATCCAATCGTCAAAAAGCTCTACCTtGTTATGGGGATrCAACTCATTC 

lU rmCAATCGAATG GGTC AATGAAGCTCTGGAAAATTACAGTTTTCTCITTTA 

GATGCTGGTCTCTAI i i iCi i ATTrCTrAAAAATGAACACGATATTGTTGTCTATACACTTGTGATGAGrTTATCGA 

CGCTGATTAACTACCTGATTAGTTATTTTTGGATTAAAAGAGACATCAA^ 

TAAACCACTCTTTCTCCCTCTGACAGCCATGTTAGTCrrrGCCAATGCCAATATGCT 

"T'CTTCCTCGTTAAAACAGGGATTGATGTCAACGTTAGTTACrATACCATAGCTCAGCGAATTCT^ 

TGGGGrrGTAACAGGTGCAATTGGAGTGAGTGTGCCTCGTCTCAGTTACTATCTGGGGAAAGGAGACAAAGAAGC 

CTATGTTTCTCTGGTTAATAGAGGTAGTCGAATCriTAACTTCTTrATCATTCCAC^^ 

TAGGACCAAATGCCATCCTACmACGGTAGTGAAAAATATATCGGAGGCGGCATCTTGACCTCTCTCTTCGC^ 
TCGTACGATTATCCTGGCCTTAGATACCATTCTTGGTTCCCAAATTCTCTTTACCAATGGCTATGAAA^ 
ACAGTCrATACAGTCTTTGCTGGGCTACTCAATTTGGGCTTGAATAGTCTC CI I nTn -CAACCATATCGTGGCTCC 

30 TCAATACTACTTACTGACAACTATCCTATCAGAGACTTCTCTACTTGTTTrCTATATCAT^ 
AACTCATCCACTTGGGACATATCTTtAGCTATACTGTTCGATACTCrCTCTmCACTT^ 
TCCrGATTAATTTCGTGTATCCTGTAGATATGGTCATTAATTTGCCATTrTTGATrA 
ATACTCGTTTGATTGTCTTGCTATCAGCTATCTCnTATATTAGTCT^ 

^ ^ GAAi t i i i AAACCATGTCCTAGCCTTAAAAAATAAATTTAAAAAATCATAG 

4179.2 

ATGAAACAACTAACCGTTGAAGATGCCAAACAAATTGAATTAGAAATTTTGGATTATATTGATACT 
AGCACAATATCAACTATATTATTAACTACGGTACTCTGATTGGGGCGGTTCGACATGAGGGCTTTAT^^ 

4U GTATAAGCTCCTATCCnTAGAAACTGATAAGAACTACTITAACAACTTTA^^ 

ATTATTGATACTCGAAATACAAAAACCTATGAGTCTGGTATCrrrATCGATAT^ 
TCa-AAGGTCATl^iVIj\CTrG^^ 

TATAAGGATAGCCrrrTAAAAGATTGGATACGAACAGCCTTCTGGTTACTCCTrCGACCGGTrrCT 
A < '^S^^'^^*^^'^^"^ AGAAAGAAATTCAAAAATATAGTCGTGAAAATG^ 

GCTTCTACAGTCATGAArrrCACGCrTATAAATTCGAGGATTAG 



4179.3 

IgSgaZ^^^ 

AGmGGTTCAAAAACTACAACTAACnrrCCGTGGCCAAGAAAAAATCGC^^ 
03 T^GCT^ATGCCACAAGATTACCACAAAAAACTGCAATTGGATTrATCCCCAATAGCCrATCrCAGTAAAACTGGG 
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GAAAAAGAGGAACTACAGAAAATCCAATCTCACCTAGCTAGTCTCAATTTCACrrTATCCAGAAATGCAGCATCA^ 
ATTCGCTCCrrATCrGGCGGACAACAGGGAAAACTCCTGCTTTTGGATTrACT^^ 
TGCTGGATGAACCCACACGAAACTTTTCTCCCACTTCTCAACCCCAAATCAGAAAACTC^^ 
CGGTCTCATCACTGTTTCGCATGACCGTCGlUM'CriAAAAGAAGTCTGCTCGATCATCTATCGCATGACAGAACAC 
5 GGTTTGAAGCTAGTTAATTTAGAAGATTTATAA 

4179.4 

ATGAAAC CAAAA ACATTTTACAACrrGCrrGCCGAGCAGAATCTTCCACTTTCGGACCAGCA^ 

GAACGTTATTTTGAGCTCTTGGTCGAGTGGAAT^ 
10 TATCTCAAACATTTTrACGATTCGATTG(:j\CCCATTCTTCAAGGTT^ 

TATCGGGGCTGGGGCAGGATTTCCTAGTCTACCAATGAAAATTCTCTATCCGGAGTTAGATGTGACCATTATTGAT 

TCACTCAATAAGCGCATCAACTTCCTACAACTCTTGGCTCAAGAACTGGATTTG 

GACGTGCCGAAGATTTTGCCCAAGACAAGAACnTCCCTGCTCAATATGATTTTGTAACAGCT^ 

TATGCAGGTCCTATCTGAATTGACTATTCCCTACCrrAAGGTTGGTGGCAAACTATTAGCACTCAAGGCTAGCAAT 
15 GCGCCTGACGAATTATTAGAAGCTAAGAATGCCCTCAATCTCCTTTTTAGTAAGGTCGAAGA^^ 

CCCTACCGAATAGAGATCCGCGCTATATCACAGTGGTACAAAAGAAAAAAGAAACACCAAATAAATATCCACGTA 

AGGCTGGTATGCCAAATAAACGCCCACTTTAA 

4179.6 

20 ATOAGTATTAAACTAATTGCCGTrGATATCGACGGAACCC7TGTC^^CAGCCA^ 

TTTCTGCCATCCAAGATGCCAAAGAAGCTGGTGTCAAAGTCGTGATTGCAACTGGCCGCCCTATCGCAGGCGTTGC 
CAAACTTCTAGACGACTTGCAGTTGAGAGACGAGGGGGACTATGTGGTAACCTTCAACGGTGCCCTTGTCCA^ 
AACTGCTACAGGACATGAGATTATCAGCGAATCCTTGACITATGACGATTATCTAGATATGGAATTCCTCAGTCG 
AAGCTCGGTGTCCACATGCATGCCATTACCAAGGACCGTATCTATACTGCAAATCGCAATATCGGAAAATACACT 

25 gtacacgaatcaaccctcgtcagcatgcctatcttctaccgtacccctgaagaaatggctggcaaagaaatt^^ 
aatgta'tgtttatcgatgaa ccag aaattctcgatgctgcgattgaaaaaatrccagcagaatm 
ctccatcaacaaatctgctccttrctacctcgaactccttaaaaagaatgtagacaagggt^ 
ttggctgaaaaactcggattgaccaaagatgaaaccatggcaatcggtgatgaagaaaatgaccgtgccatgctg 
gaagtcgttggaaaccccgttgtcatcgaaaatggaaatccagaaatcaaaaaaatcgccaaatacatcaccaaa 

30 acaaatgacgaatccggcgttgcccatgccatccgaacatgggtactgtaa 

4179.7 

atgacttggattattcttggagttatcgctcttattgttatttttgtgattgt^ 

tcgtatgcaaaccaaggaggcttggagtcagattgatgttcagttgaaacgtcgcaatgacctcttgccaaac^ 
35 attgagactgtaaaaggttatgccaaatatgaaggttctacccttgaaaaggtggcagaactacgtaaccaagtg 
gcggcagcgacttcaccagcagaagctatgaaagccagtgatgccctcactcgtcaggtttcaggta i 1 k 1 i gcag 

TTGCAGAAAGCTATCCAGATTTGAAAGCTAGTGCTAACTTTGrrAAATTGCAAGAGGAGTTGACAAACACAGAAA 

AT A AAA r r 1 c n actctcgtcaactctat aacagtgttgtcagcaactacaatct aaaattag aaactttcccg ag 

CAATATTATCGCTGGAATGTTTGGATTTAAAGCGGCAGATrrCCTTCAAACACCTGAAGAGGAAAAGTCGGTTCCT 
40 AAAGTTGATTTTAGCGGTTTAGGTGACTAA 

4179.8 

ATGTTGTTTGATCAAATTGCAAGCAATAAACGAAAAACCTGGATTTTGTTGCTGGT Arr 
TGTTGG TTAT GCGGTTG GTTATC TCTTTATAAGATCTGGACTTGGTGGTTTGGTTATTGCACTGA^ 
45 TCTACGCTTTGTCTATGATTTTTCAATCGACAGAGArTGTCATGTCCATGAATGGAGCGCGTGAGGTGGATGAGCA 
Aj\CGGCACCAGACCTCTACCATGTAGTGGAAGATATGGCTCTGGTCGCTCAGATTCCTATGCCCCGTGTTTTCATC 

attgatgatccagccttaaatgccittgcgacaggttctaatcctcaaaatgcggctgttgctgcgac^ 
tactagctatcatgaatcgtgaagaactagaagctgttatgggacatgaagtcagtcatattcgtaattatgatat 

CCGTATTrCGACTATTGCAGrrGCCCTTGCTAGTGCTATCACCATGCTTTCTAGTATGGCAGGTCGTATGATG^^ 

dU ggggtggagcaggtcgcagacgaagtgatgatgaccgagatggaaatgctcttgaaatcattatgctagtggttt 

CCCTACTAGCTATTGTACTGGCACCTCTCGCTGCAACCTTGGTTCAGCTCGCTATTTCTCGTCAGAGGG^ 

gcagatgcatctagtgtcgagctgactcgcaatccccaggcaatgattaatgccctagataagttggacaatagc 

AAACCTAT GAGTC GCCACGTCGATGATGCTAGCAGTGCCCTTTATATCAATGATCCTAAGAAAGGTGGGGGGTTC 
CAAAAACTcl i i i ATACCCACCCACCTATCTCAGAACGGATTGAACGTTTAAAACAGATGTAA 

4179-9 

ATGAAATTAAATATTCAAGAAATTCGTAAGCAGTCTGAAGGTTTGAACTTTGAACAAACGTTAGATT^ 
ACCTGCGTGCACGTAATCAAGAAATrrrAGATGTAAAAGATATCCTTGCAGTTGGGAAAGTACAATATGAAGACC 
GTATGTAi i i c i i AG ATTATCAACTATCTTATACCATTGTTCTTGCTTCGAGTCGCAGTATGGAGCCAGTTGAGTTA 
OU GTTGAATCTTATCCAGTCACGGAAGTTrrCATGGAAGGCGCAACTAACCAGCTAGATCAAGAAGTTTTAGATO^ 
ACTTGGTCTTGCCCATCGAAAATGGGGAGCTTGACCTTGCTGAGAGTGTATCAGACAATATCCTGCTAAACA^^ 

tatcaaggtcttgacggctgaagaagaagctggtcaaggarrtatctcaggaaatgactggcaaatcatgacaga 
ggaagaataccaagctcaaaaagcagtaaagaaagaagaaaacagtccttttgctggcttacaaggactatt^ 

CGGAGATGAATAA 
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4179.12 

ATGGAGTTATrrATGAAAATCACAAACTATG>^u\ATCTATAAGTrAAAAAAATCAGGT^ 

TGAAAGTGCTAGAATACGGTGAAAATGTTGATCAGGAGCl i i'l GTTGGGTGATATTGCAGATATCTCAGCTTGCCG 
TAATCCAGCCGTTTTTATGGAACGTTATrTTCAGATAGACCATGCGCATTTGTCGAAAGAGTTTCAA^ 
5 TCTTTCTCTATTTTAGATGACTGTTATCCTTGGGArrTCAGTGAAATATATGATGCGCCTGTACi i 1 T ATTTTACAA 

GGGAAATCTTGACCTCCTCAAATTCCCGAAGGTAGCGGTCGTGGGCAGTCGTGCTTGTAGCAAACAGCGAGCT 
GTCAGTTGAAAAAGTCATTCAAGGCTTGGAAAATGAACTGGTTATTGTCAGTGGTCTGGCCAAGGGCATT GACA C 
AGCAGCTCATATGGCAGCTCTTCAGAATGGCGGAAAAACCATTGCAGTGATTGGAACAGGACTGGATGTGTTTTA 
TCCTAAAGCCAATAAACGCrrGCAAGACTACATCGCCAATGACCATCTGGTTCTAAGTGAATATGGACCTGGTGA 
10 ACAACCTCTGAAATTTCATTTTCCTGCCCGTAATCGCATCATrGCTGGACTTTGTCGTGGTGTGAT^ 

GCTAAGATGCGTTCAGCTAGTCTCATTACGTGTGAGCGAGCAATGGAAGAAGGACGCGATGTCTTTGCTATTCCT^ 
GTAGCATTTTAGATGGACTATCAGACGGTTGCCATCATTTGATTCAAGAAGGAGCAAAATTGGTCACCAG^^ 
AAGATUTlCl'i GCGGAATTTG AATTTTAA 

15 418UI 

ATG AAACGTCAATTAGCCTrGGTCGTCTTTAGTGGTGGTCAAG ATTCAACAACCTGC C IV Vi CTGGGTCATGCAAC 
ACTATGAAACAGTCGAAGCTGTCACCTTTGCCTACGGCCAACGTCATCACCrCGAAATTCAAATTACTAGAGAy^ 
CGCTAAGGAACAGGGCATTCGTCACCATATCCTCGATATGTCTCTGCrrGGGACAAATCACTGCTCAGCCAGACTTT 
GCGACGATTCATATTTCCTACATTCCTGACAAGCTCTGTGTCGAGTCAAAATCCCTCAAACTATATCTAT^ 
20 ACCGAAACCACGGAGATTTCCACGAAAACTGTATCAACACCATCGCGAAAGACrTGGTCAACTTGCTAGACCCTC 
GCTAT7TAGAAGTCTGGGGAAAATTCACTCCGCGCGGTGGCATTTCAATCGACCCCTACTACAACTACCCTAAGCA 
AGGAACrrAAGTATGAGGGCTTGGCAGAACAACGCCTCTTCCA^CACGACCT7TATCCAGAGAAj\ATTG 
CTAA 

25 418K2 

ATGACCGAAACGGTAGAAGATAAAGTAAGTCATTCAATTACTGGGCTTGATATCCTCAAGGGGATAGTTGCTGCG 
GGAGCTGTCATAAGTG GAAC CGTTGCAACTCAAACGAAGCTATTTACAAATGAGTCAGCAGTACTTGAAAA 
GTAGAGAAAACGGATGCTTTGGCAACAAATGATACACTAGTTCTAGGTACGATATCTACAAGTAATTCAGCGAGT 
TCAACTACTTTGTCAGCTTCAGAGTCGGCAAGTACATCTGCATCTGAGTCAGCCTCAACCAGCGCTTCGACCTC^ 

30 CAAGTACAAGTGCATCAGAATCAGCAAGTACATCGGCTTCGACAAGTATTTCTGCATCATCTACTGTGGTAGGT^ 
ACAAj\CAGCTGCCGCTACAGAAGCAACTGCTAAGAAGGTCGAAGAAGATCGTAAGAAACCAGCTAGTGATTATGT 
AGCATCAGTTACAA^TGTCAATCTCCAATCTTATGCTAAGCGACGCAAGCGTTCAGTGGATTCCATCGAGCAATTG 
CTGGCTTCTATAAAAAATGCTGCTG [ ri'l'riCTGGCAATACGATTGTAAATGGCGCCCCTGCAATTAATGCAAGTC 
TAAACATTGCTAAAAGTGAGACAAAAGTTTATACAGGTGAAGGTGTAGATTCGGTATATCGTGTTCCAATTTACTA 

35 TAAATTGAAAGTGACAAATGATGGTTCAAAArrGACCTTTACCTATACGGTTACGTATGTGAATCCTAAAACAA^ 
GATCTTGGTAATATATCAAGTATGCGTCCTGGATATTCTATCTATAATTCAGGTACTTCAJ^CACAAACAATGT^ 
CCCTTGGCAGTGATCTTGGTAAACCTTCAGGTGTAAAGAACTACATTACTGACAAj\AAT^ 

CTATAATACATCTACAATGACGACCCAGGGTAGTGGGTATACTTGGGGAAATGGTGCCCAAATGAATGGTTTCTTT 
GCTAAG AAAG GATATGGATTAACATCATCTTGGACTGTACCAATTACTGGAACGGA 
40 TACATCCrrrACATTTACCCCTTACGCTGCTAGAACAGATAGAATTGGAAT^ 

gtagttgaatctagcacgaccagtcagtcactttcacagtctaagtcactctcagtaagtgctagtcaaagcgcct 

CAGCTTCACCATCAACAAGTGCGTCGGCrrCAGCATCAACCAGTGCCTCGGCTTCAGCGTCAACCAGTGCGTCAG 

CTTCAGCAAGTACCAGTGCTTCAGTCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCCTCGGCTTC 

AGCAAGCACATCAGCATCTGAATCAGCGTCAACCAGTGCTTCGGCTTCAGCAAGTACCAGTGCTTCAGCTTCAGC 

45 ATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTrCTGAATCGGCCTCAACCAGCGCCTCGGCCTCAGCAAG 
CACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCGGCTTCAGCAAGCAC 
AAGCCCCTCGCGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCCTCAGCATCAACAAG 
TGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCAGCATCAACGAGTAC 
GTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTC 

50 AGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCGTCGACAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGC 
ATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCGGCTTCAGCATCAACCAGTGCCTCGGCTTCAGCGTCA 
ACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGfCTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACA 
AGTGCCTCGGCTTCAGCAAGCACATCAGCATCTGAATCAGCGTCGACAAGCGCCTCAGCTTCAGCAAGTACCAGT 
GCGTCAGCTTCAGCATCAACCAGCGCCTCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCC 

55 TCGGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGCGCCTCAGCCTCAGCATCAACGAGTGCTTCG 
GCTTCAGCAAGCACAAGCGCCTCGGGTTCAGCATCAACGAGTACGTCAGCTTCAGCGTCAACCAGTGCTTCAGCC 
TCAGCATCAACAAGTGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGTGCGTCTGAGTCA 
GCATCAACGAGTACGTCAGCCTCAGCAAGCACCTCAGCTTCTGAATCGGCCTCAACCAGTGCCTCAGCCTCAGCA 
TCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCTTCAGCCTCAGCTCGACAAGTGCGTCGGCCTCAACCAGTG 

OU CATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCAT 
CGGCTTCAGCATCAACCAGTGCCTCGGCTrCAGCGTCAACCAGTGCGTCAGCTTCAGCAAGTACCAGTGCTTCAGT 
CTCAGCATCAACAAGTGCTTCAGCCTCAGCATCGACAAGTGCcTCGGCTTCAGCAAGCACATCAGCATCTGAATCA 
GCGTCGACAAGCGCcTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGACAAGTGCGTCAGCCTCAGCA 
AGTACTAGTGCATCAGCTTCAGCATCAACGAGTGCATCGGCTTCGGCGTCAACCAGTGCATCAGAGTCAGCAAGT 

05 ACCAGTGCGTCAGCrrCCGCATCAACAAGTGCCTCGGCTTCAGCAAGCACCAGTGCGTCGGCrrCAGCAAGTACT 
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AGCGCCTCAGCCTCAGCCTCAACCACrrGCGTCAGCCTCAGCAAGTATCTCAGCGTCTGAATCGGCATCAACGAGT 

GCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCTCACCGTCAACAAGTGCATCGGCTTCAGCGTCAACGAGT^ 

TCTGAATCGGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCrCAGCGTCAACAAGTGCATCG 

GCTTCAGCATCAACGAGTGCGTCCGCTTCAGCAAGTACTAGCGCCTCAGCCrCAGCGTCAACAAGTGCATC^ 

CAGCGTCAACGAGTGCGTCTGAGTCAGCATCAACGAGTGCGTCAGCCTCAGCAAGCACATCAGCrrCTGAATCTG 

CATCAACCAGTGCGTCAGCCTCAGCATCGACAAGCGCCTCAGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCGT 

CGACAAGTGCGTCGGCTTCAGCAAGTACCAGTGCGTCAGCCTCAGCAAGTACCAGTGCGTCAGCCTCAGCGTCGA 

CAAGTGCGTCGGCCTCAACCAGTGCATCTGAATCGGCATCAACCAGTGCGTCAGCCTCAGCAAGTACTAGTGCAT 

CAGCTTCAGCATCAACGAGTGCATCGGCTTCAGCATCAACCAGTGCATCAGAGTCAGCAAGTACCAGTGCGTCAG 

TTCCGCATCAACAAGTGCCTCGGCTTCAGCAAGTACTAG 

4183.2 

ATGGGGGTCGAAACTTGGTTTTATrCTAGCATCTGCTGGCTGGCCATCGGGCTTGGTTCCGT^ 

ACATGACTGCTGCTAATGGCGGTGG AGGC 1" i ' I ' I " 1' ACT AATCTTTCTCATTTCCACTATTTT AATCGGTTTCCCTCTC 

CTGCTGGCTG AGTTT GCCCTTGGCCGTAGTGCTGGCGTTTCCGCTATCAAAACCTTTGGAAAACT 

ACAAGTACAACTTrATCGGTTGGATTGGCGCCrTTGCCCTCrrTATCCTCTTAT Ci 1 i i I ACAGTGTTATCGGAGG A 

TGGATTCTAGTCTATCTAGGTATTGAGTTTGGGAAATTGTTCCAACTTGGTGGAACGGGTGATTATGCTCAG^ 

TTACTTCAATCATTTCAAATCCAGCCATTGCCCTAGGAGCTCAAGCGGCCrTTATCCTATTGAAT^ 

TCACGTGGGGTTCAAAAAGGGATTGAAAGAGCTTCGAAAGTCATGATGCCCCTGCTCTTTATCGTCm 

TCATCGGTCGCTCTCTCAGTTTGCCAAAT<K:CATGGAAGGGGTTCrrr'ACTTCCTCAAAC 

GACTAGCACTGGTCTCCTCTATGCTCTGGGACAATCTTTCTTTCCCCTCTCACTAGGGGTT 

ATGCTTCTTACTTAGACAAGAAAACCAATCTAGTCCAGTCAGGAATCT 

CATCATGGCAGGTCTAGCCATTTrCCAAGCTCGATCCCCCrTCAATATCCAGTCTGAAGGGGGACCCAGCCTGCTC 
TTTATCGTCTTGC CTC AAC TCTTTGACAAG ATGCC T i ' i ' i GG AACCAr^^TCTACGTCCTCTTCCTCTTGCTCTTCCTT 
TTTGCGACAGTCAC 1 1 i i iCTGTCGTGATGCTGGAAATCAATGTAGACAATATCACCAACCAGGATAACAGCAAAC 
GTGCCAAATGGA GTGTTA TTTTAGGAATTTTGACCTTTGTCTTTGGCATTCCTTCA^ 

GCGGATGTTC ACAT TT TTGGTA AGACLl ICl 1 IGACGCTATGGACTTCTTGGTTTCCAATCTCCTCATGCCATTrGG 
AGCTCTCTACCTTTCACTTTTTACAGGCTATATCTTTAAAAAGGCTC^ 

AGAGCATGGAAACAAGGACTGTTCCAAGTCrGGCTCTTC Ul iLt i C GI ' i 1 Li r CGTrTCGTCATTCCAATCATCATC 
ATTGTGGTCTTCATTGCCCAATTTATGTAATCAAAAAGGACTTGAGTAG 

4183.5 

ATGTTGAAAAAATGGCAGTTAAAAGATGTTATCTTGCTTG C 1 I ' l Cri GTCTAT C i i ' ITlT GGTGGG G ' rri 1 CGTTGG 

TTCAGGATATGTGTATAATATTCTCAGTCTACTCTTAACACCTCTTGGTTTGCAGGCCTTTGC^^ 

TCGGTCTCTGGTGTATGGCTGC GCCC ATTGCTGCCATCTTTGTTCCGAGAGTCGGAAGTGCAACGATTGGAGAAGT 

GCTAGCTGCGCTTGCT GAAG TCCTTTATGGTAGCCAATTTGGTCTAGGAGCTCTTTTGTCTGG 

TGGGAAGTGAATTTGGTTTTATCGTAACTAAGAATCGCTATGAAAGTTGGCTCTCTCT 

GATTACGCTTGT TAGC TTTGTCTATGAATACATTAAGTTAGGTTACTACGCCTTrrCCCT^ 

GCTTGTGGTACGTTTTATTTCTGTTTATITCTTCTGTACCATCrrGGTTCGTGCC^^ 

TGCAACTGGAGGAAAAGCATAG 

4183.6 

ATGGTCAAAGTAGCA ACCCA GACACCG ATTATCAGTCTC I 1 C IM'GCTG ATTTTATCCTTGG AA ACATCTTTCATTCC 
TTCGATTGCTCTGACTCTTTCGGTAGTCGCATTTTGTATT 

CTTGGATGATCATACnTGCCATTTT ACCA TCTTrTGCCAACTACTGGGCAGTTCAGTT^ 
GGCAGTCATGCTTGGAACGAGGCCCTTTGTGACAGTTTGTATCGGCCTTGTCTTTGTTTCCTCT 
AGCTTCTCTTGTACITGGCTCAAAAGGGGCTATCACGCTCTTGGTCCTATGCCTTGATTGTGGT^ 
CCTCTCATTCAGCAAGAAATCAAGTCCCTCAAAGAAGCTTGCCTATTACGTGGTCAAG^ 

CCTrGATTTACAGTAAGGTTCTGATGACAGTCTTTAGGTGGCGCCATCTTTACCTGAGAGCTCTATCTGCTCACGG 
ATATGACGAA CATGCACAGTTGAAGAATAGCTATCGGACTTTTTATATTCCTAAAAAAACAAAATTAATCTACCT 
CTTTTCTTTTTATTCCTTCAAACCAGTCTATTTTTATAA 

4183.7 

ATGA GAAA GCACCAATTACAAGTTCACAAATTAACCATTTTATCTATGATGATTGCCCrrGATGTAGTCCr^ 

CTATCTTTCGAATTGAGGGAATGGCACCCATGTCCAGTGTAGTCAATATTCTAGCAGGAATCATGATGGGACCTGT 

TTATGCCTTGGCrATGGCTACAGTCACAGCCTTT^ 

GGA GCGA Ci 1 I iGGAGCCCTTCTAGCAGGTCTCTTTTATAAGTACGGTCGAAAATTTCACTATTCTGCTCTAGGAG 

AGATTTTGGGAACAGGTArrATTGGTTCCATTGTTTCCTATCCTGTTATGGTACTCITTACAGGAT^ 

CTTV^CTGGTTTATCTACACGOT 

ATTTTTAATCAAGCAGGAATTCTTTAAAAAAGTGCAGGGATATTTCTTTAGTGAAAGGATAGACTGA 
4183.8 

ATGCAGGAATTTACAAATCCCTTTCCTATAGGCTCTAGTT^ 

GATGCTGGCAAATGGGATTTTGGCTCTGGGATGCAAACCTGTCATGGCAGATGATTCCCGTGAAGTTCTTGATTTT 
ACTAAGCAAAGTCAGGCTCTCTTCATCAATTTGGGGCATTTGTCAGCTGAGAAGGAAAAAGCAATCCGCATGGCA 
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GCTTCGTATGCAAA CCAAT CrrCTCTCCCGATGCrrAGTAGATGCGGTTGGCGTAACGACTTCATCCA 

GCTT AGTT AAAG ACC i'l'V 1' AG ACTATAG ACCTACGGTCCTTA AAGG AAACATGTCAG AAATTCG AAGTCTTGTTGG 

ATTAAAGCACCACGGCGTTGGGGTCGATGCGAGTGCTAAAGATCAAGAAACGGAGGATTTGCTTCAAGTCTTGAA 

AGACTGGTGTCAGACCTATCCTGGTATGTCri^'CTlAGTCACAGGTCCCAAGGACCTCGTCGTTTCGAAAAATCAG 

GTCGCTGTACTGGGAAATGGCTGTACTGAATTAGACTGGATAACAGGGACAGGAGACTTGGTTGGAGCCTTAACA 

GCTGl'l i I ICTCAGCCAAGGAAAGACTGGTi'l'iGAAGLi ICl-lGCTTAGCAGTCTCTTATCTCAATATCGCTGCTGA 

GAAAATAGTTGTTCAAGGAATGGGATTGGAAGAATTTCGTTACCAAGTACTCAATCAGCTTTCGCTCCTA^ 

AGATGAAAATTGGCTAGATACCATCAAAGGAGAGGTTTATGAATAG 



4185.3 

ATGAACCATAAAATCGCAATTTTATCAGATGTTCATGGCAATGCGACGGCGCTAGAAGCAGTGATTGCAGATGCT 

15 aaaaatcaaggggccagtgaatattggcttctgggagatal i i i lcl i cctggtccaggcgcaaatgacttagtcg 
ccctgctaaaggaccttcctatcacagcaagtgttcgaggcaattgggatgatcgtgtccttgaggcm 
gcaatatggcttagaagacccacaggaagttcagctcttgcgtatcacacagtatttgatggagcgaatggatcc 
tgcaacgattgtctggctacgaagcttgcctttgctggaaaagaaagaaattgacggattgcg ci ' ri 
cataatttacctgacaaaaactatggtggtgacttgctagttgagaatgatacacagaaatttgaccaactgct 

20 gatgcxjgaaacggacgtggcagtttatggtcatgtrcacaagcagttgcrrcgttatggaagtc 

ATCATCAATCCAGGGTCGATTGGCATGCCCTATTTTAATTGGGAGGCGTTAAAAAATCACCCTTCCCAGTATGCCG 
TGATAGAAGTTGAAGATGGGGAAT TACT CAATATCCAATTTCGTAAAGTTGCTTATGATTACGAAGCTGAGTTAGA 
ATTGGCCAAGTCCAAGGGGCTTCCCrrrATCGAAATGTATGAAGAACTGCGTCGTGACGATAACTATCAGGGGCA 
CAATCTGGAATTATTAGCCAGCTTAATAGAAAAGCATGGGTATGTAGAGGATGTGAAGAA i 1 1 i 11 I GA nTlTiG 
25 TAA 

/ 

4186 J 

ATGAATGTAAATCAGATTGTACGGATTATTCCTACTTTAAAAGCTAATAATAGAAAATrAAATGAAACAT^^ 
TTG AAACCCTTGG AATG AAGGCCrrGTTAGAAGAATCGG 
30 GCTGGTTTTAGAAGAAGCTCCCAGTATGCGTACTCGTAAGGTAGAGGGAAGAAAAAAACTAGCTAGATTGAT^ 
CAAGGTGGAAAATCCCTTAGAAj^GAAGGAATOT 

AAATGGCTACGC i "I ' 1 ' 1 G AAATTITCTC ACCAG AAG ATG ATTTG ATTTTG ATTCATGCGG AAG ATG ACAT AGCAAGT 
CTAGTAGAAGTAGGAGAAAAGCCTGAATTTCAAACAGATTTGGCATCAATTTCTTTAAGTAAAm 
TGGAATTACATCTCCCAACTGATATCGAAAGTTTCnTGGAATCATCTGAAATTGGGGCATCCCTTGATm 
35 AGCTCAGGGGCAGGATTTGACTGTGGACAATACCGTTACCTGGGACTTATCTATCCTCAAGTTCTTGGTCAATC 
TTAGACATAGCAAGTCTTCGCCAGAAGTrTGAGTCTACTGAATATTTTATTCCTAAGTCTGAAAA^ 
GTAAAGATAGAAATAATGTTGAATTGTGGTTTGAAGAAGTATGA 

4186.2 

40 AT GAAG TGGACCAAGATTATTAj^AAAAATAGAAGAACAAATCGAGGCAGGGATTTATCCCGGAGCCTCTTTTGCG 
TATTTTAAGGACAATCAATGGACAGAGTTCTATTTAGGCCAGAGTGACCCAGAGCATGGCnTGCAGACTGAGGC^ 
GGACTAGTTTATGACCTAGCTAGTGTCAGCAAGGTTGTTGGGGTTGGCACAGTTTGTACCTTCTTGTGGGAAATAG 
GTCAATTAGATATTGATAGACTGGTAATAGATrrrrrACCTGAGAGTGATTATCCAGACATCACTATTCGCCAGCT 
CTTGACTCATGCAACAGACCTTGATCCrnTA rrCCT AATCGTGATCTTT^ 

45 ATGTTTCA TCTCAA CAGACGAAGTCAGCCAGC Cl i ' l ' C I ' l ' l ATTCGGATGTCCA i i I ' iUU ^GCTGTTGGGCTTTATTTT 

GGAAAGAATTTTTAATCAAGATTTGGATGTGATTTTAAAGGATC^ 

TAAGTTTGGGCCAGTTGAGCTTGCTGTTCCAACAGTTAGAGGTGTAGAGGCAGGCATAGTGCATGATCCCAAGGC 
TCGTCTCCTGGGTAGACATGCTGGGAGTGCTGGTTTATT1TCGACTATAAA 
TATTrAGCAGATGATTTTGCAAGAGACTTAAATCAAAATTrTTCTCCTTTGGATG^ 
50 GGAATrrGG AAGG AGATTGGCTAGACCATACGGGCTATACAGGTACCTTTATCATGTGGAATCGTCAGAAGCAAG 
AAGCCACTATTTTCCTATCGAATCGTACCTATGAAAAGGACGAGAGAGCTCAATGGATATTAGACCGCAATCAAG 
TGATGAACTTGATTCGCAAAGAAGAGTAA 

4187.2 

55 ATGATGAAGAAGACTTATAATCATATTTTGGTCTGGGGAGTCATTTTCTATAGCATTTGCATTGTCT^ 

TACT CCTC AAGAACAATCTACCGTGGGAGTGGGAACTCCAGGTATTCAGCATCTTGGACGCCTGG I f i 1 ICI 1 1 f G 
ACTCCTTTCAATTCTCTCTGGAAACr^ 

TCCTCA ATGTCT T CTTGr rriTTCCTCTGATTTTCCAACTCC^ 
,^ GTCCr rcrr rri'AGl-riU-CrrGTGAGTCTTGGAATCGAGTGTACGCAATTAATCTTGGA Cl i 11 ICil I GATTTCAAT 

OU CGCGTCTTTGAGATTGATGATTTGTGGACCAACACTTTGGGTGGCTATCTGGCnTGGCTCC^ 

ATAAAAACAAGGTAAGGAATTAA 

65 
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4188.1 

ATGAAGATTCCTCTCTTAA Ll 1 H GCAAGGCATA.\A i 1 IGi I ' l ATGTCTTGCTTACnTGC'i'i 1 1 i L i t GCriTGGTT 
TATCGTGAT Oi i i i GATGACTTA fl ' i t ITl 1 i 1 GATATTCATGCGCCCGATCTAGCTA.AATTCGATGGACAAGCAAT 
TAAAAATGACTTATTAAAATCAGCATTAGATTTTCGTATTCTCCAGTTCAATCTAGGTT^ 

5 TTCCAATCATCATTGTTTTGCTAGGTTTTCAATATATTGACCTGAAAAATAAAGT^ 

AGAAGTGAGTTATCAAGGGTTAAAAAGAAAGTTGACTTTGCAAGTTGCAAGTATCCCTTGT^ 
ACTGTGCTGATAATTGCAATTATAACCTATr I'LT 1' i GGG AC I ' 1" 1 ' !" 1 CTCCTCTTGG ATGG AATTCTCTATTTTCTGAT 
GG AAGTGGTTTACAAAG ACTCCT AG ATGG AG AG AT AAAAAGCTA i ' fl GTTCTi T ACTTGTGTCCTACT AATCGGT A 
TTTTCATCAATGCAATCTAi i i M 1 ACAAATAGTTGATTATGTGGGGAATGTGACTCGTTCGGCAATCACCTATTTG 
10 ATGTTTCTTTGGCrTGGrrCTATGCTGCTTTATAGTGCCTTGCCTTACTATA^^ 
AGCTAGCTATGGGGATGTAAGTTTGATGAAACTCTTTACT 
AAAAATATGAAGATAATGTTTAA 

4188.2 

15 ATGAAGATAATGTTTAAGAATrTTAACAATATTTTGCTAAATAGAAAGATTGTTT^ 

GATGA n I 1 GATAAACCATCTATTGTCAACAGCGGTTCAAAAGCAGGATGCTGTTATCr i'r ri CA AGAGA G AATTG 
ATTTC AATrrTTTCCTATAATGACTATTCTG AAGCGAATTTAG AAATCCCCAAACTAT^ 1" 1' 1' 1 

CATGGTAGGATGGCTCTCTGTCATTrTACTTGAAAGTGATTTGGCAGACCATTACCATCACTTGATTCGCTATCi^ 
TCAAGCTCCITTTTCGATTATACAAGGAAACGATTGGTTGTCATrrCTAAAiU-rrinA 
20 GTTT<nTGGTTTACTTCCTCTAGGAATTCATrrCAAAACA<7rCGCAC^^ 

GTACTTACTACTGTCTTATCTGATAGCACTGATTAGTGCGGGCGCTGGTTTTTCCTTT^ 

tgtgggacaagaatggatgatggatcatattgtaacagtgtatttagtactcttaagtttattagttatgttga^ 
gttagtcgcttggaagagaaatttaagaaaggataa 

25 4188.5 

atgggcaXaggagagatgggcaaaggagttattggcttggagttcgactcagaagtattggtcaacaaggctcca 
acccttcaattggcaaatggtaaaacagcgactrrcctaacccagtatgatagcaagaccttgttgrrrcc 
ataaggaagatatcggacaggaaattattggtatagctaaaggaagcatcgaaagtatgcataatcttcctgtaa 
atctagcaggtgccagagttcctggcggagtaaatggtagcaaagcagcggtgcatgaagttccagaatttacag 
30 ggggagttaatggtacagagccagctgttcatgaaatcgcagagtataagggatctgattcgcttgtaactcttac 
tacaaaaaaagattatacttacaaagctcctcttgctcagcaggcacitcctgaaacaggaaacaaggagact 

CCTCCTAGCTTCACTAGGACTAACAGCl 'i iLl ICCTTGGTCTGTTTACGCTAGGGAAAAAGAGAGAACAATAA 
4188.10 

35 atgtttaaagrntacaaaaagttggaaaagcttttatgttacctatagctatacrrcctgcag 

ggggattggtggtgcactttcaaacccaaccacgatagcaacttatccaatactagacaatagtatttttcaatca 
atattccaagtaatgagctctgcaggagaggttgtattcagtaattrgtcactacttctctgtgtgggattatgta 
ttggcttagcgaaacgagataaaggaaccgctgcgttagcaggagtaactggttacttagttatgactgcaacga 
tcaaagctttggtaaaacl i 1 1 i atggcagaaggatctgcaattgatactggagttattgga gcatt agttgtcgg 

40 aatagttgccgtatatttgcacaaccgatataacaatattcaattaccttccgctttaggattctttg gaggt ^ 
cgcttcgttcctattgttacatcgttctcttctatcttgattggctttgtct^ 
caagttcttgtttctacaggtggatatattrctcaggcgggtccaattggaacttttctatatggat^^ 
actttctggagcagtaggcttacatcatataatttaccctatgttttggtatactgaacttgg 
ttgcaggacaaacagtggttggagctcaaaaaatal'rril'lgctcaattagccgatttggcccattctggattatt 

45 tacagaaggaacaaggttttttgcaggtcgtrrctcaacaatgatgttcggtttaccggct^ 

taccatagtgttcctaaaaatcgtcgtaaaaaatacgcgggtttgri-lu-riggagttgctttaacatt'i 1 1 1 attac 
cggtattacagaaccaattgaatttatgtttctattcgtcagtccggttctatatgttgttcacgcattcot 
gtgttagctt cttt attgcagacgtcttaaatattrcaataggaaacacattttc^ 
ttatttggaat tttgc aggggaacgcraagacgaattgggttcttcagattccatttgga 

50 gtattatattatttttagatggttcattactcaattcaacgttctaacgccagggcgaggagaagaagtagattct 
aaagaaatttctgaatccgcagattcaacttcaaatactgcagarratttaaaacaggatagcctacaaattatca 
gagcctrgggtggatcaaataatatagaagatgtagatgcttgtgtgacacgtttacgtgtagctgtaaaagaag 
ttaatcaagttgataaagcacttttaaaacaaattggtgcagttgatgtcttagaagtgaagggtggcat^^ 
aatct atgg agcaaaagc aatcttatataaaaatagt atta atg a aattttaggtgt ag atg att aa 

55 

4188. U 

ATGAAATTrAGAAAATTAGCTTGTACAGTACTTGCGGGTGCTGCGGrrCTTGGTCTTGCTGCTTGTGGC 

GCGGAAGTAAAGATGCTGCCAAATCAGGTGGTGACGGTGCCAAAACAGAAATCACTTGGTGGGCATTCCCAGTAT 

TTACCCAAGAAAAAACTGGTGACGGTGTTGGAACTTATGAAAAATCAATCATCGAAGCGTTTGAAAAAGCAAACC 

60 CAGATATAAAAGTGAAATrGGAAACCATCGACTTCAAGTCAGGTCCTGAAAAAATCACAACAGCCATCGAAGCAG 
GAACAGCTCCAGACGTACTCTTTGATGCACCAGGACGTATCATCCAATACGGTAAAAACGGTAAATTGGCTGAGT 
TGAATGACCTCTTCACAGATGAATTTGTTAAAGATGTCAACAATGAAAACATCGTACAAGCAAGTAAAGCTGGAG 
ACAAGGCTTATATGTATCCGATTAGTTCTGCCCCATTCTACATGGCAATGAACAAGAAAATGTTAGAAGATGCTGG 
AGTAGCAAACCTTGTAAAAGAAGGTTGGACAACTGATGATTTTGAAAAAGTATTGAAAGCACTTAAAGACAAGGG 

65 TTACACACCAGGrrCATTGTTCAGTTCTGGTCAAGGGGGAGACCAAGGAACACGTGCCTTTATCTCTAACCTTTAT 
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AGCGGTTCTGTAACAGATGAAAAAGTTAGCAAATATACAACTGATGATCCTAAATTCGTCAAAGGTCTTGA AAAA 

GCAACTAGCTGGATTAAAGACAATTTGATC^ATAATGGTTCACAATrrGACGGTGGGGCAGATAT^^ 

GCCAACGGTCAAACATCTTACACAATCCTTTGGGCACCAGCTCAAAATGGTATCCAAGCTA^ 

AGTAAGGTAGAAGTGGTAGAAGTACCATTCCCATCAGACGAAGGTAAGCCAGCrCTTGAGTACCTTGTAAACGGG 

rrTGCAGTATTCAACAATAAAGACGACAAGAAAGTCGCTGCATCTAAGAAATTCATCCAGTTTATCGCAGATGAC 

AAGGAGTGGGGACCTAAAGACGTAGTTCGTACAGGTGCITTCCCAGTCCGTACTTCATTTGGAAAAC^ 

ACAAACGC\TGGAAACAATCAGCGGCTGGACTCAATACTACTCACCATACTACAACACTATTGATGGATTTGC^ 

AAATGAGAACACTTTGGTTCCCAATCTTGCAATCTGTATCAAATGGTGACGAAAAACCAGCAGATGCTTTGA^ 

CCTTCACTGAAAAAGCGAACGAAACAATCAAAAAAGCTATGAAACAATAG 



4188.12 

ATGCAATCTACAGAAAAAAAACCATTAACAGCCTrrACTGTTATTTCAACAATCATTTTGCTCT^ 
TGTTCATCTTTCCATTCTACTGGATTTTGACAGGGGCATTCAAATCACAACCTGATACAATTGTTATT^ 
TGGTTCCCTAAAATGCCAACCATCGAAAACrrCCAACAACTCATGGTGCAGAACCCTGCCTTGCAATGGATGT^ 
15 AACTCAGTATrrATCTCATrCGTAACCATGTTCTTAGTTTGTGCAACCTCATCTCTAGCAGGTTATGTATTGG 
AAAACGTTTCTATGGTCAACGCATTCTATTTGCTATCrTTATCGCTGCTATGGCGCr^ 

TACCATTGGTACGTATCGTCAACTTCATGGGAATCCATCATACTCTCTGGGCAGTTATCTTGCCTTTGAT^^ 
GCCATTCGGTGTCTTCCTCATGAAACAGTTCAGTGAAAATATCCCTACAGAGTTGCTrGAATCAGCTAAAAT^ 
GGTTGTGGTGAGATTCGTACCTTCTGGAGTGTAGCCTTCCCGATTGTOAAACCAGGGTTTGCAGCCCTTGCAATCT 
20 TT ACCrrCATCAAT ACTTGG AATG ACT ACTTCATGCAATTGGTAATGTTG ACTTC At^ I ' ll O ACCATCTCA 

CTTGGGGTTGCGACCATGCAGGCTGAAATGGCAACCAACTATGGTTTGATTATGGCAGGAGCTGCCCTTGCTGCTG 
TTCCAATCGTCACAGTCTTCCTAGTCrrTCCAAAAATCCTTCACACAGGGTATTACTATGGGAGCGGTC 
A 

25 4191.1 

ATGAAAAAAACI'l I'l'i'ICriACTGGTGTTAGGCrrUlU'nGC CIl ' Cri CCACTCTCTG llU ' l ' i 'GCCATTGATTTCAAG 
ATAAACTCITATCAj^GGGGATTTGTATATTCATC 

ttgaggacgactttaagggccaaatcgtgggacrrggacgtgctggtaagatgcctagcgggtttgacatrgacc 
ctcatccaaagattcaggccgcgaaaaacggtgcagaactagcagatgtgactagcgaagtaacagaagaagcg 
30 gatggttat actgt g agag tctataatccaggtcaggagggcgacatagttgaacttgacctcgtctggaactta 
AAAAATTT AC i ■ I r i ccttt atgatg atatcgctg aatta aattggcaacctctg acag atagttcag agtct attg 

AAAAGTTTGAATTTCATGTAAGGGGAGACAAGGGGGCTGAAAAACTCI 1 i'iTCCATACAGGGAAACrn'l'IAGAG 

agggaacgattgaaaagagtaac cttga ttatactatccgtttagacaatcttccggctaagcgtgcagttgagtt 
gcatgcctattggcctcggaccgattttgctagcgctagggatcagggattgaaagggaatcgttragaagagttt 
35 aata agatagaagactcgattgttagagaaaaagatcagagtaaacaacrcgttacttgggtcctcccttcgatc 
ctttccatc7ccttgttattgagtgtctgcttctattttatttatagaagaaagaccact 

CAAAA ATC ATCGTCTCTATG AACCACCAATGG AATT AG AGCCTATGG I ' I ' I'l ATCAGAAGC AGTCT ACTCGACCTCC 
TTGGAGGAAGTGAGTCCCTrGGTCAAGGGAGCrGGAAAATTCACCTTTGATCAACTTATTCAAGCTACCTTGCTAG 
ATGTGATAGA CCGT GGGAATGTCTCTATCATTTCAGAAGGAGATGCAGTTGGTTTGAGGCTAGTAAAAGAAGATG 

40 GTTTCTCAAGCTTTGAGAAAGACTGCCTAAATCTAGLi i I I iCAGGTAAAAAAGAAGAAACTCTTTCCAA mGTT 

TGCGGATTACAAGGTA TCTGAT AGTCrrTATCGTAGAGCCAAAGTTTCTGATGAAAAACGGATTCAAGCAAGAGG 
GCirCAACTCAAATCi'lCrL'ri GAAG AGGTATTGAACCAGATGCAAGAAGGAGTGAGAAAACGAGTTTCCTTC^ 
GGGCrCCCAGATrATTATCGTCCTTTAACTGGTGGGGAAAAGGCCTTGCAACTGGGTATGGGTGCCTTGACTATCC 
TGCCCCT^^TTTATCGGATTTGG^ 

45 CirGGrmCTAGGGTTAGTTTTGTCTGTTTrCTATTArrGGAAGCTrCGACTAGATAATCGTGATGGTGTTCT 

TGAAGCGGGAGCTGAGGTCTACTATCTCTGGACCAGTTTTGAAAATATGTTGCGTGAGATTGCACGATTGGATCAG 
GCrGAACTGGAAAGTATTGTGGTCrGGAATCGCCTCTTGGTCrATGCGACCrTATTTGGCTATGCGGACAAGGTTA 
GTCA TTTG ATGAAGGTTCATCAGATTCAAGTGGAAAATCCAGATATCAATCTCTATGTAGCTTATGGCTGGCACAG 
TACGTTTTATCATTCAACAGCACAAATGAGCCATTATGCTAGTGTCCCAAATACAGCAAGCACCTACTCTGTATCT 

50 TCrGGAAGTGGAAGTTCTGGTGGTGGCTTCTCTGGAGGCGGAGGTGGCGGCAGTATCGGTGCCTTTTAA 

4191.2 

atg aaaaa agtaa g aaag atatttc ag a aggc agttgcagg actgtgctgt at atctc agttg acag c i ' fi ' 1 ' 1 c u 
cgatagttgctttagcagaaacgcc tgaa accagtccagcgataggaaaagtagtgattaaggagacaggcgaag 

55 gaggagcgcttcraggagatgcc gtctt tgagttgaaaaacaatacggatggcacaactgtttcgcaaaggacag 
aggcgcaaacaggagaagcgatarrrrcaaacataaaacctgggacatacaccttgacagaagcccaacctccag 
ttggttataaaccctctactaaacaatggactgttgaagttgagaagaatggtcggacgactgtccaaggtgaac 
aggtagaaaatcgagaagaggctctatctgaccagtatccacaaacagggacttatccagatgttcaaacacctt 
atcagattattaaggta gatg gttcggaaaaaaacggacagcacaaggcgttgaatccgaatccatatgaacgtg 

60 tgattccagaaggtacactttcaaagagaatttatcaagtgaataatttggatgataaccaatatggaatcgaatt 
gacggttagtgggaaaacagtgtatgaacaaaaagataagtctgtgccgctggatgtcgttatcttgctcgataa 

CTCAAATAGTATGAGTAACATTCGAAACAAGAATGCTCGACGTGCGGAAAGAGCTGGTGAGGCGACACGTTCTCT 
TATTGATAAAATTACATCTGATrCAGAAAATAGGGTAGCGCrTGTGACTTATGCTTCCACTATCTrrGATGGGACC 
GAGTTTAC AGTA GAAAAAGGGGTAGCAGATAAAAACGGaAAGCGATTGAATGATTCT L 1 1 i 1 1 I ggaattatgat 

65 cagacgagttttacaaccaataccaaagattatagttatttaaagctgactaatgataagaatgacattgtagaat 
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TAAAAAATAAGGTACCTACCGAGGCAGA.\GACCATGATGGAAATAGATTGATGTACCAATTCGGTGCCA Li i t lA 
CTCAGAAAGCTTTGATGAAGGCAGATGAGATTTTGACACAACAAGCGAGACAAAATAGTCAAAAAGTCATT^ 
ATATTACGGATGGTGTCCCAACTATGTCGTATCCGArrAATTTTAATCATGCTACGTTTGCTC^ 
CAACTAAATGCA I i 1 i 1 1 agtaaatctcctaataaagatggaatactattaagtgattttattacgcaagcaacta 
5 gtggagaacatacaattgtacgcggagatgggcaaagttaccagatgtttacagataagacagttratgaaaaag 
gtgctcctgcagctttcccagttaaacctgaaaaatattctgaaatgaaggcggctggttatgca 
tccaattaatggtggatatatttggcttaattggagagagagtattctggcttatccgtttaattct 

AAATTACCAATCATGGTGACCCTACAAGATGGTACTATAACGGGAATATTGCTCCTGATGGGTATGATCTCT^ 
GGTAGGTATTGGTATrAACGGAGATCCTGGTACGGATGAAGCAACGGCTACTAGrrrrATGCAAAGTATTTCTAGT 

10 AAACCTGAAAACTATACCAATGrrACTGACACGACAAAAATATTGGAACAGTTGAATCGTTATTTCCACA^^ 

GTAACTGAAAAGAAATCAATTGAGAATGGTACGATTACAGATCCGATGGGTGAGTTAATTGATTTGCAATTGGGC 
ACAGATGGAAGATTTGATCCAGCAGATTACACTTTAACTGCAAACGATGCrrAGTCGCTTGGAGAATGGAC^ 
GTAGGTGGTCO\CAAAATGATGGTGGTTTGTTAAAAAATGCAAAAGTGCTCTATGATACGACTGAGAAAA GGAT^ 
CGTGTAACAGG TCTG TACCTTGGAACGGATGAAAAAGTTACGTTGACCTACAATGTTCGTTTGAATGATGAGT^ 

15 TAAGCAATAAATTTTATGATACCAATGGTCGAACAACCTTACATCCTAAGGAAGTAGAACAGAACACAGTGCGCG 
ACTTCCCGATTCCTAAGATTCGTGATGTGCGGAAGTATCCAGAAATCACAATTTCAAAAGAGAAAAAACT^ 
ACATTGAGTTTATTAAGGTCAATAAAAATGATAAAAAACCACTGAGAGGTGCGGTCTTTAGTCTTCAAAAACAAC 
ATCCGGATTATCCAGATATTTATGGAGCTATTGATCAAAATGGCACTTATCAAAATGTGAGAACAGGTGAAGATG 
GTAAGTTGACCrmAAAAATCTGTCAGATGGGAAATATCGATTATTrGAAAATTCTGAACCAGCTGOT 

20 CGTTCAAAATAAGCCTATCGTTGCCrrCCAAATAGTAAATGGAGAAGTCAGAGATGTGACT^ 

GATATACCAGCGGGTTACGAGTTTACGAATGATAAGCACTATATTACCAATGAACCTATTCCTCCAAAGAGAGAA 
TATCCTCGAACTGGTGGTATCGGAATGTTGCCATTGTATCTGATAGGTTGCATGATGATGGGAGGAGTTCTATTAT 
ACACACGGAAACATCCGTAA 

25 

4191.3 

ATG AAATCAATCAACAAA I T III AACAATGCTTGCTGCCTTATTACrG ACAGCG AGTAGCCTGTTTTCAGCTGCAA 
CAGTTTTrGCGGCTGGGACGACAACAACATCTGTTACCGTTCATAAACTATTGGCAACAGATGGGGATATGGATA 

30 AAATTGCAAATGAGTTAGAAACAGGTAACTATGCTGGTAATAAAGTGGCTGTTCTACCTGCAAATGCAAAAGAAA 
TTGCCGGTGTTATGTTCGTTTGGACAAATACTAATAATGAAArrArrGATGAAAATGGCCAAACTCTAGGAGTGA^ 
TATTGATCCACAAACATTTAAACTCTCAGGGGCAATGCCGGCAACTGCAATGAAAAAATTAA<>GAAGC7rGAA 
AGCTAAATTTAACACGGCAAATTTACCAGCTGCTAAGTATAAAATTTATGAAATTCACAGTTT 
GGTGAAGATGGAGCAACCTTAACAGGTTCTAAAGCAGTTCCAATTGAAATTGAATTACCATTGAACGATGTTGTG 

35 GATGCGCATGTGTATCCAAAAAATACAGAAGCAAAGCCAAAAATTGATAAAGATTTCAAAGGTAAAGCAAATCCA 
GATACACCACGTGTAGATAAAGATACACCTGTGAACCACCAAGTTGGAGATGTTGTAGAGTACGAAATTGTTACA 
AAAATTCCAGCACTTGCTAATTATGCAACAGCAAACTGGAGCGATAGAATGACTGAAGGTTTGGCATTCA^ 
GGTACAGTGAAAGTAACTGTrGATGATGTTGCACTTGAAGCAGGTGATTATGCTCTAACAGAAGTAGCAACTGGTT 
TTG ATTTG AA ATTAACAG ATGCTGGTTTAG CTAAAGTG AATG ACCA AAACG CTG AAAAA ACTGTG A AAATC AC^ 

40 ATTCGGCAACATTGAATGACAAAGCAATTGTAGAAGTACCAGAATCTAATGATGTAACATTTAACTATGGTAATA 
ATCCAGATCACGGGAATACTCCAAAGCCGAATAAGCCAAATGAAAACGGCGATTTGACATTGACCAAGACATGGC 
TTGATGCTACAGGTGCACCAATTCCGGCTGGAGCTGAAGCAACGTTCGATTTGGTTAATGCTCAGACTGGTAAAGT 
TGTACAAACTGTAACTTTGACAACAGACAAAAATACAGTTACTGTTAACGGATTGGATAAAAATACAGAATATAA 
ATTCGTTGAACGTAGTATAAAAGGGTATTCAGCAGATTATCAAGAAATCACTACAGCTGGAGAAATTGCTGTCAA 

45 GAACTGGAAAGACGAAAATCCAAAACCACTTGATCCAACAGAGCCAAAAGTTGTTACATATGCTAAAAAGTTTG^ 
CAAAGTTAATGATAAAGATAATCGTTTAGCTGGGGCAGAATTTGTAATTGCAAATGCTGATAATGCTGGTCAAT^ 
TTAGCACGTAAAGCAGATAAAGTGAGTCAAGAAGAGAAGCAGTTGGTTGTTACAACAAAGGATGCTTTAGATAGA 

gcagttgctccttataaccctcttactgcacaacaacaaactcagcaagaaaaagagaaagttgacaaagct 
gctgcttataatgctgctgtgattgctgccaacaatgcatttgaatgggtggcagataaggacaatgaaaatgttg 
50 tgaaattagtttctgatgcacaaggtcgctttgaaatracaggccttcrrgcaggtacatattacttagaagaaac 
aaaacagcctgctggttatgcartactaactagccgtcagaaatttgaagtcactgcaacttcttarrcagcgact 
' ggacaaggcattgagtatactgctggttcaggtaaagatgacgctacaaaagtagtcaacaaaaaaatcactatc 
ccacaaacgggtggtattggtacaattatctttgctgtagcgggggctgcgattatgggtattgcagtgtacgcat 
atgttaaaaacaacaaagatgaggatcaacttgcttaa 

4191.4 

ATGACAATGCAGAAAATGCAGAAAATGATTAGTCGTATCT'rCl'l' i GTTATGGCTCTGTG' i - ri - ri CTCTTGTATGGGG 
TGCACATGCAGTCCAAGCGCAAGAAGATCACACGTTGGTCTTGCAATTGGAGAACTATCAGGAGGTGGTTAGTCA 
ATTGCCATCTCGTGATGGTCATCGGTTGCAAGTATGGAAGTTGGATGATTCGTATTCCTATGATGATCGGGTGCAA 

60 ATTGTAAG AG ACTTGCATTCGTGGG ATG AG AATA AA C T 11 L M C 1 ' I ' t CA AAAAGACTTCGTTTG AGATG ACCTTCC 

TTGAGAATCAGATTGAAG TATCTCA TATTCCAAATGGTCTTTACTATGTTCGCTCTATTATCCAGACGGATGCGGT 
TTCTTATCCAGCTGAAi r rCT 1 11 IGAAATGACAGATCAAACGGTAGAGCCTTTGGTCATTGTAGCGAAAAAAACA 
GATACAATGACAACAAAGGTGAAGCTGATAAAGGTGGATCaAGACCACAATCGCTTGGAGGGTGTCGGCTTTAAA 
TTGGTATCAGTAGCAAGAGATGTTTCTGAAAAAGAGGTTCCCrrGATTGGAGAATACCGTTACAGTTCrrCTGGTC 

65 AAGTAGGGAGAACTCTCTATACTGATAAAAATGGAGAGATTTTTGTGACAAATCTTCCTCTTGGGAACT^ 
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CAAGGAGGTGGAGCCACTGGCAGGCTATGCTGTTACGACGCTGCATACGGATGTCCAGCTGGTAGATCATCAGCT 
GGTGACGATTACGGTTGTCAATCAGAAATTACCACCTGGCAATGTrGACTTTATGAAGGTGGATGGTCGGACCAA 
TACCTCTCrrCAAGGGGCAATGTTCAAAGTCATGAAAGAAGAAAGCGGACACTATACTCCTGTTCTTCAAA^ 
AAGGAAGTAGTTGTAACATCAGGGAAAGATGGTCGTTTCCGAGTGGAAGGTCTAGAGTATGGGACATACTATTTA 
5 TGGGAGCTCCAJ\GCTCCAACTGGTTATGTTCAATTAACATCGCCTGTTTCCTTTACAATCGGGAAAG 

AGGAACTGGTAACAGTGGTTAAAAATAACAAGCGACCACGGATTGATGTGCCAGATACACGGGAAGAAACCCTTG 
TATATCTTG ATG CTTGTTGCCATTTTGTTGTTTGGT AG 

4191.5 

10 ATGAGCCACATATACTTATCTATTTTCACAAGTCTCTTGCTGATGCTAGGACrrGTCAATGT^ 

AATATTTACGCATCCGTATGGAAGCAGCATATGCTCCCTTTAACTGGACCCAGGATGATGATAGCAACGGAGCTG 
TCAAAATCG ATGG GACCAATCAGTATGCCAACGGATACGATGTTCAAATCGCCAAGAAAATCGCTAAGGACTTAG 
GTAAAGAACCTTTGGTTGrrAAAACCAAGTGGGAAGGTCTAGTCCCTGCCCTTACTrCTGGTAAGA 
TA TCGC^ GGTATGAGTCCAACTGCAGAACGCAAACAAGAAATTGCCTTTTCGAGCAGTTACTATACTAGCGAACC 

1 D AGTTTTGCTTGTCAAAAA AGAT TCTGCCTACGCAAGTGCTAAATCTTTGGATGACT^ 

TCTCAACAAGGGGTCTACCrrTATAACTTGATTGCACAAATCCCAGGTGCTAAAAAAGAAACAGCCATGGGAGA^ 
TTCGCTCAAATGCGACAAGCTCrrGAGGCTGGTGTCATTGATGCTTATGTTTCTGAACGTCCAGAAGC^ 
CTGAAGCTGCGAACTCTAAGTrCAAGATGATTCAAGTAGAACCTGGTrrCAAAACTGGGGAAGAAGATAC^ 
TCGCTATCGGGCTTCGTAAAAATGACAATCGTATTAGCCAAATCAATGCCAGCATTGAAACCATTTCAAAAGATG 

li) ACC AAGTT GCCTTGATGGATCGTATGATCAAGGAACAACCTGCCGAAGCTACAACAACTGAAGAGACTAGCAGTA 
GTTTCTTTAGCCAAGTTGCTAAAATTCTTTCTGAAAACTGGCAACAACTOT 

ATCrCTATCGTCGGAACCATCATAGGTCTCATrATTGGACTTGCCATTGGTGTCTTCCGTACTGCTCCTCTCTC^ 
AAACAAAGTCATTTACGGCCTACAAAAACTAGTCGGCTGGGTTCTCAATGTCTACATTGAAATm 
CCAATGATTGTTCAATCG ATGGTTATCTACTATGG AACTGCCCAAGCTTTCGGG ATCAACC 

25 TrGACCGTACACTGGCrGCrATCTTCATCGTTTCAATCAATACCGGTGCCTACATGACrGAAATCGTCCGTGGTGG 
TATCCTAGCAGTTGACAAGGGACAATTTGAAGCTGCGACTGCTCTTGGTATGACCCATAACCAGACCATGCGTAA 
GATTGTCCTACCTCAGGTAGTCCGTAACATCCTACCTGCAACTGGTAATGAATTTGTCATCAATATCAA^ 
TCTGTATTGAACGTTATCTCTGTTGTCGAACTTTATTTCTCAGGAAATACCGTGGCAACACA^ 
CCAGACATrrACAATCATCGCCGTGATTTACTTTGTCCTCACCTTCACCGTAACACGTATCCTACGC^ 

iU GCAGAATGGACATGGATACCTACACTACAGGTGCTAACCAAATGCAAACGGAGGATTTGAAATAA 

4191.6 

ATGACACAAGCAATCCTTGAAATTAAACACCTCAAAAAATCCTATGGACAAAACGAAGTGCTAAAAGACAT^ 
CTCACTGTCCACAAGGGAGAGGTCATCTCTATCATCGGAAGCTCTGGAAGCGGAAAATCGACCTTCCTACGCTCC 
ATTAACCTACTTGAAACACCAACTGATGGACAAATCCTTTATCATGGACAAAACGTCCTCGAAAAAGGCTATG^^ 
CTCACGCAATACCGTGAAAAGTTGGGGATGGTTTTCCAATCCTTTAACCTCrrrGAAAATCTCAATGT^ 
ACACAATCGTCGCTCAGACAACrGTCCTAAAACGCGAACGCACAGAAGCTGAAAAGATTGCCAAAGAAAACCTG 
GAAAAGGTCGGCATCGGAGAACGCTACTGGCAAGCCAAACCAAAACAACTCTCAGGTGGTCAAAAACAACGTGT 
GGCCATCGCTCGTGCCCTCTCCATGAATCCGGACGCTATTCrCTTTGATGAACCAACATCAGCTCTCGATCCAGAA 
4U ATGGTTGGAGAAGTCCTCAAAATCATGCAGGACCTGGCTCAGGAAGGCTTGACCATGATTGTCGTAACCCATGAA 
ATGGAATTTGCCCGTGATGTCTCTCACCGTGTTATCTTTATGGATAAGGGCCTGATCGCTGAAGAAGGTAAACCAG 
AAGACCTCTTCACCAATCCTAAAGAAGACCGAACAAAAGAGTTCCTTCAACGCTATCrCAAATAA 

4192.3 

45 ATGAAAAAGTATCAACTTCTATTCAAAATAAGTGCAGTCTTCTCnTACTTAr^^ 

TALi i 1 1 lATTGGAGTCATGATTGTGGTTCrrGTTAAGACAGGCCATGGTTATCTCTTCCGCArrCCAAGAAAAAA 
ATGGCTTTGGTATTCCATTrrGACAGTATTAGTGCTAGTGTTCCAGATCTCTTTTAACGTTCAGAC^^ 
TTCAGTCAACTGCGGAAGGTTGGGCTGTATTGATTGGTTATAGTGGGACTAACTTTGC^^ 
AGCCCTGTTCTTTCTGGTTCCACTGATGGAAGAATTGArrrATAGAGGATTACTGCAACATGCTTTC^ 
CGCGATrrGGTCTTGATrrGCrrCITCCTTCTATTTrATTTGCTCTCCCTCA 

ATATCTTCGTCTTTGCAACAGTTGGAATCATCTTTGCTGGTTTGACCCGCTATACCAAGAGCAm 
GCGGTGCATGTGATCAATAATATTGTAGCGACCrrCCCGTTrrrGCTCACrmCTACATAGGGTCTrGG^ 



50 



55 4193.1 

ATGAACAAGAAACAATGGCTAGGTCTTGGCCTAGTTGCAGTGGCAGCAGTTGGACTrGCTGCATGTGGTAACCGC 
TCTTCrCGTAACGCACCrrCATCTTCTGATGTGAAGACAAAAGCAGCAATCGTCACTGATACTGGTGGTCTrGATG 
ACAAATCATTCAACCAATCAGCrrGGGAAGGTTTGCAGGCrTGGGGT/^ 
GTTTCACrTACrrCCAATCAACAAGTGAAGCTGACTACGCTAACAAOT 

AATCTrCGGTGTTGGrrTTGCCCTTAATAATGCAGTTAAAGATGCAGCAAAAGAACACACTGAm 
03 AGTAGCTGGTGCTACAGGTGCAGGTGTCTTrGCAGAGGCAAAATCTCTCAACGAAAGCCGTCCrCAAAATGAAAA 



60 
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AGTTTGGCrrTATCGCrrGTTGATCGTGACCAAGAAGCACAAGGTAAATACACTTCTAAAGATGGCAAAGAATCAAA 
Cl'HGl ICl lGTATCrACTTTGAAACAAGTTGGTACAACTGTAAAAGATATTTCTAACAAGGCAGAAAGAGGAGAA 
TTCCCTGGCGGTCAAGTGATCGTTTACTCATTGAAGGATAAAGGGGTTGACrrGGCAGTAACAAACCm 
AAGGTAAAAAAGCTGTCGAAGATGCAAAAGCTAAAATCCTTGATCGAAGCGTAAAAGT 
5 TCCTGAAAAATAA 

4193.3 

ATGTCTAAAAAATTACAACAAATTTCGGTTCCCTTGATTTCTGTATTCCTAGGAATTT^ 
GTGGATCTTCGGTTATGATGCTATTTGGGGCTACGAAGAATTGTTCTATACAGCCTTTGGCAGTCrGCGTGG 

10 GGAGAAATCTTCCGTGCTATGGGTCCTCrGGTCTTGATTGGTCTTGGTT^ 

TAACGTCGGACTTCCTGGTCAGGCTTTGGCAGGTTGGATTCTCAGTGGTTGGTTTGCCCTGTCGCATCCAGAT^ 
CCCCGTCCCTTGATGATTCTAGCAACCATCGTGATTCCCTTGATTGCTGGTGGGATTGTCGGAGCGATTCCAGG^ 
TTC TTAG GGCCr AT CTAG GGACGTCAGAGGTTATTGTAACCATCATGATGAACTACATTGTCTTGTATGTAGGGAA 
TGCCTTTATCCATGCTTTCCCTAAAGACTTCATGCAAAGTACAGATTCGACCATT^ 

15 TATCAG A CACC TTGGTTGGCTGAGTTG ACTGGTAACTCACGG ATG AATATTGGTA i i ' 14 C i ' 1 ' i GCCATCATTGCCGT 

TGCAGTTATTTGGTTCATGCTCAAGAAAACAACTCTTGGTTTTGAAATCCGTGCAGTTGGTCTTy^ 
CAGAATATGCTGGTATTTCTGCCAAGCGGACTATTATCCTATCTATGATTATTTCAGGTGCCrrGGCAGG^ 
GGAGCTGTTGAAGGTTTGGGAACCTTCCAGAACGTCTATGTTCAAGGTTCGTCATTAGCT 
TGGCGGTTAGTTTGCTTGCGGCCAACTCACCAATTGGTATACTCTTTGCAGCCrTCCTAT^ 

20 GGGGCTCCTGGTATGAATGCGGCG CAGG TACCATCTGAGCTrGTCAGCATTGTAACAGCGTCTATTAT CI ' I ' Crri 'G 
TCAGTGTTCATTACCTTATCGAACGCmGTCAAACCGAAAAAACAAGTTAAAGGAGGTAAGTAA 

4194.1 

ATGGGACTGAAAAAGAAACTAAAGTTGACTAGTTTGCTAGGACTGTCTCTGTTAATCATCACAGCCT^ 
25 AATGGGpTAACTAGCGATATTACAGCCGAATCGGCTGATT^ 
TTCGCTTnTATCGTTTGATATTAGT 

CCAGTCTTTCAGGTGCAAATGGTGGCTTCTAGGAAAATGCAGGAAGCTCAGCCACGCATTAAGGCGCTTCGAG 
CAATATCCAGGTCGAGATA TGGA AAGCAGAACCAAACTAGAGCAGGAAATGCGTAAAGTATTTAAAGAAATGGG 
TCTCAGACAGTCAGACTCTCnTrGGCCGATTr^ 
30 GAGTTGACTTTTTAAAGACAGGTCATTTCTTATGGATTAACC^ 

TTAGCAGCAGTATTCACCriTrrAAGTACTTGGTTGTCCAACAAAGCrrTGTCTGAGCGA^ 

CGATGATGTATGGGATTCCACTCTTGATTTTTATCTTTGCAGTTTATGCGCCACGTGGAGTCGCCCTATACTGG 

AGTGTCTAATGCrrATCAAGTCTTGCAAACCTATTTCTTGAATAATCCATTCAAGAT^ 

GTAGTACAGGCACAAAAAGATTTGGAAAATAGAAAAAGAAAAGCCAAGAAAAAGGCTCAGAAAACGAAATAA 
4194,4 

ATGGTTATCGATCCATTTGCTATCAACGAACTAGACTATTACTTAGTTTCACACTTCCACAGTGAT^ 

CATACACAGCTGCAGCAATTCTCAATAATCCTAAGTTAGAGCATGTTAAGTTTATCGGTCCTTACCACTGTGGACG 

AATCTGGGAAGGATGGGGTGTTCCAAAAGAACGTATCATCGTTGTTAAACCAGGTGACACTATCGAATTAAAAGA 

40 TATGAAGATTCATGCAGTAGAATCATTTGACCGTACTTGCTTGGTAACTCTCCCAGTGAACGGTGCTGATGAGACA 
GGCGGTGAACTTGCTGGCTTGGCTGTTACAGATGAAGAAATGGCTCAAAAGGCTGTTAACTATATCT^ 
CAGGTGGAACCATCTATCATGGTGCAGATrCTCACTTCrCAAACTATTTTGCAAAACATGGT^ 
TGATGTTGCTTTGAATAACTATGGTGAAAATCCGGTAGGTATCCAAGACAAAATGACATCTATCGACCTTCTTCGT 
ATGGCAGAAAATCT GCGT ACCAAAGTCATTATCCCAGTTCACTATGATATCTGGTCTAACTTCATGGCTTCT^ 

45 ATGAGATTCTAGAACTTTGGAAAATGCGAAAAGATCGCTTGCAATACGATTTCCATCCATTT^ 

CGGTAAGTACACTTATCCTCAAGATCAACACTTAGTAGAATACCATCATCCACGTGGTTTTGATGATTGTrTTG 
CAAGACTCTAACATTCAATTTAAAGCTTTGCTATAA 

50 

4196.2 

ATG TrCCTT TCAGG CTGGT TGTCTAGTTTTGCTAATACTTATATCCATGATrrACTGGG 
TCCATTpTAAATGCC^ 

00 TTGTTTTGGCTTTGATTCCTGTGCGAAAATTAAAATCTTTGTITT^ 

ATGATTAAGGATATTGGTTATATTCGTACGGATTTGCCAGAGGGCTTTGACTTTACTATT^ 
GTATCATCTCAGGAATTGCCTCTCACTGGACTTTTTCAGGTCTAGCTGTAGTAGGTGTrrACTTGC^ 
TATAAA GGACA GAAGGTrGGCAAGAAACAGGGCCTTATTTTTCTAGGTTTAGCCTTGGGAACT 
ACTCTCCTTTTGTGGAGpGGAAACAGAGTTCCCTT^ 

OU tatcatgcttattgctttgttgagaaacacaatgagttgatgacctag 

4197.1 

atgaaggtggaaccacgttgcgacgtcctttcgaggatgtcgcatttttttattaggat^ 

AAGAATTAGTGGAGCGCAGTTGGCCAATCCGACAAGCTTATCACGAACTGGAAGTTAAGCATCATGATtCCAACT 
00 GGACGGTAGAAGAAGACCTCTTGGCrrrATCTAATGATATTGGAAATTTCCAACGACTCGTGATGAC^ 
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GACGCTACTATGATGAAACACCCTACACACTGGAACAAAAACrTTCAGAAAATATCTGGTGGCTATTAGAAC^^ 
CTCAACGTTTGGATATAGACATTCTGACGGAAATGGAAAACTTCCTCTCTGATAAAGAAAAGCAATTGAACGTTA 
GGACTTGGAAGTAG 

5 4197.4 

ATGCTTGATTGGAAACAAT 1 I i n CTAGCCTATCTGCGCTCCCGTAGTCGT Cl i I i 1 ATCTATCTGCn iLl i i GGC 
Arri ' LTl GTCTTACTCTTTCA Gn 1 1 lA TTTGCCAGTCTAGGAATTTACTTCCTCTA Ll ' i I 1 iLi i L I i GTGTTGCTTT 
GTAACCATATTAi'l i 1 1 CACTTGGGACATATTGGTGGAAACGCAGGTCTATCGCCAGGAACTTCTCTATGGAGAGA 
GGG AAGCCAAGTCTCCTTTGGAAATAGCTTTAGCAGAAAAATTAGAAGCGCGTGAGATGGAuACT 

10 GGTCAAAAGCAGAAAGAAAACTGACGGArrrGCTGGATTACTATACCTTGTGGGTCCATCAGATAAAGA CCCC CA 
TTGCACCCAGTCAACTCTTAGTTGCAGAAGTGGTCGACCGCCAACTGAAGCAGCAGCTAGAACAGGAAATTT^ 
AAATCGACTCCTATACCAACCTAGTTTTACAGTACCTGCGTTTAGAAAGTTTCCATGATGATTTGCT 
GGTTCAAATTGAGGACTTGGTCAAGGAAATAATTCGTAAATATGCrCl'1'1"! L i ri ATTCAAAAAGGCTTAAATGTC 
AATCTACATGACCTTGATAAAGAAATCGTGACGGATAAAAAGTGGCTGCTAGTGGTTATTGAGCAAATCATCTCA 

15 AACAGTCTCAAGTACACCAAGGAAGGTGGTCTGGAGATTTATATGGATGACCAAGAGCTTTGTATCAAAGATACG 
GGAATCGGGATAAAAAACAGTGATGTCCTCCGAGTATrrGAACGTGGCTTTTCAGGATACAATGGCCGT^ 
AGCAGTCCTCTGGACTTGGCCTTTATCTATCTAAGAAAATTTCTGAAGAACTGGGGCACCAGATTCGTATC 
TGAGGTCGGAAAAGGAACGACAGTGCGGATTCAGTTTGCTCAAGTGAACTTAGTCCTTGAGTAA 

20 4211.2 

ATGGAACrrAATACACACAATGCTGAAATCTTGCTCAGTGCAGCTAATAAGTCCCACTATCCGCAGGATG/^ 
CCAGAGATTGCCCTAGCAGGGCGTTCAAATGTTGCTAAATCCAGCTTTATCAACACTATGTTGAACCGTA AGAA TC 
TCGCCCGTACATCAGGAAAACCTGGTAAAACCCAGCTCCTGAACTT T I "i i AACATTGATGACAAGATGCGCTTTGT 
GGATGTGCCTGGTTATGGCTATGCTCGTGTTTCTAAAAAGGAACGTGAAAAGTGGGGGTGCATGATTGAGGAGTA 
25 CTTAACGACTCGGGAAAATCTCCGTGCGGTTGTCAGTCTAGTTGACCTTCGTCATGACCCGTCAGCAGATGATGTG 
CAGATGTACGAATTTCTCAAGTATTATGAGATTCCAGTCATCATTGTGGCGACCAAGGCGGACAAGATTC CTCGTG 
GTAAATGGAACAAGCATGAATCAGCAATCAAAAAGAAATTAAACTTTGACCCGAGTGACGATTTCATCCTCri 1 TC 
ATCTGTCAGTAAGGCAGGGATGGATGAGGCTTGGGATGCAATCTTAGAAAAATTGTGA 

30 4211.3 

ATGACAAAGAAACAACTTCACTTGGTGATTGTGACAGGGATGAGTGGCGCAGGGAAAj^CTGTAGCCATTCAGT^ 

TTCGAGGATCTAGGTTATTTCACCATTGATAATATGCCGCCAGCTCTCTTGCCTAAGriTTTGCAG 

TAAGG AAGACAATCCTAAGTTGGCCTTGGTAGTGGATATGCGTAGCC G 1 ' I ' C 1 1 - 1 CI IT 1 cagagattcaagctgtt 

TTGG ATGAGTTGGAAAATCAAGATGGTTTGGATTTCAAAATCCT C 1 T i T 1 GGATGCGGCTG ATAAGG AATTGGTCG 

35 CTCGTTACAAGGAAACCAGACGGAGTCACCCACTAGCAGCAGACGGTCGTATTTTAGATGGAATCAAGTTGGAAC 
GTGAACrCTTGGCACCTTTGAAAAATATGAGCCAAAATGTGGTGGATACGACTGAACTCACTCCACGTGAGcTGCG 
CAAAACCCTTGCAGAGCAGTTTrCAGACCAAGAACAAGCCCAGTCTrrCCGTATCGAAGTCATGTCTTTC 
AAGTATGGAATCCCGATTGATGCGGACTTGGTCrrrGATGTCCG I i iLl 1 GCCAAATCCCTATTATT TACCA GAACT 
GAGAAACCAAACGGGTGTGGATGAACCTGTTTATGATTATGTCATGAACCATCCTGAGTCAGAAGACTTTTATCAA 

40 CATTTATTGGCCTTGATTGAGCCGATTCTGCCAAGTTACCAAAAGGAAGGTAAGTCCGTTTTGAC^^ 

_ GATGTACGGGTGGACAACACCGTAGTGTGGCATTTGCTAAACGCTTGGCGCAGCACTTATCCAAGAATTGGTCTGT 
TAATGAAGGGCATCGCGACAAAGACCGCAGAAAGGAAACGGTAAACCGTTCATGA 

421 L4 

45 ATGAGAAAACCAAAGATAACGGTGATTGGTGGAGGGACTGGAAGTCCCGTCATTCTAAAAAGTCTGCGGGAAAAA 
GATGTGGAAATCGCAGCTATCGTGACGCTGGCAGATGATGGTGGTTCTTCAGGTGAACTCCGAAAAAATATGCAA 
CAGTTGACACCGCCAGGTGATCTTCGTAATGTCCTTGTGGCCATGTCGGATATGCCTAAGTTTTATGAGAAGGTCT 
TTCAGTATCGGTTCTCTGAGGATGCCGGAGCCrrrTGCTGGCCATCCATTGGGAAATCTCATCATTGCTGGCTTGTC 
AG AAATGCAGGGTTC AACCTAT AATGCC ATGC AGTT ATTG AGO AA A n T 1 ' 1 CCATAC A AC AGGG AAAATTT ATCCT 

50 TCCACTGACCATCCITrGACCCTrCATGCAGTCTTTCAGGATGGGACAGAAGTGGCTGGAGAGAGTCATATTGTAG 

accatcgaggcataattgacaatgtctatgtgaccaatgccctaaacgatgatacgcctctggccagccgtcgag 
tagtgcagaccatccttgaaagtgacatgattgtcctagggccaggttccctctttacctctattttgcccaatat 
cgtgattaaggaaattgggcgggctcl 1 riggaaaccaacgcagaaattgcctatgtctgcaatatcatgaccca 
acgtgggcagacggaacactttacagatagcgaccacgtggaactcttgcatcgtcaccttggtcgcccl 1 i i'atc 
55 gacactgtcttggtgaatattgaaaaagtgcctcaggaatacatgaattccaaccgttttgatgaatac^ 
aagtggaacacgattttgtaggtcrrrgtaagcaagtttcgcgcgtgatttcatctaacttccttc 
tggcggtgccttccacgatggagatttgattgtggacgagttcatgcgcattatacaggtgaaaaaatga 

4213.1 

60 ATGAAAAATTTGATAAAGTTGCTAATAATTAGATTGATTGTTAACTTAGCAGACAGTGTATTTTATA^ 
TGTGGCACCTTj\GCAATAATTATTCTTCGAGCATGTTCTTA 
TTACTAATCTTTTTTGGACCAGTTATTGACAGAGTAAATCC 

AGCAGTGGCTGTAATATTTTTATTATTATTAAACCAAATATCATmGGGTGATAATGAGTCTAGTGT^ 

taatggctagctccataagttacgtgatagaagatgtgttgattcctcaagtggtagaatatgataagattgtatt 

65 TGCAAATTCTCriTTTAGTATTTCGTATAAAGTATTAGATTCTA Tl 1 1 i AATTCATTCGCATCATTTTTACAGGTGG 
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CAGTAGGATTTATTTTATTGGTTAAGATAGATATAGGCATATTTTTACTTGCT 

TTTAGAACTAGCAATGCGAATATAGAAAACTTCTCTTTCAAATATTACAAGAGAGA AGTGTTG CA^ 
TTTATTrrAAATAATAAATTATTATITAAAACCAGTATTrCTT^ 

GTAGrrGTACCGATTTTrrCTATTCGATATTTTGATCGTCCGAl i 1 i i i ATGGTA'n-i'n'i'TAACTATTGCTGGTTTG 
5 GGTGGTATATTGCGAAATATGCTAGCGCCAATCGTAATAAAATATTTAAA ATCGA ATCAAA TTGTTGGTGTA TTTC 
TTTTTTTGAACGGCTCAAGTTGGTTAGTAGCAATTGTTATAAAAGACrrATACTT^ 

GTTTTATGTCTAAAGG ACTCrrC AAT ATTA" r r 1 ' i ' I AATTCGTTGT ACC AAC AAAT ACCTCCACATCAAC in^ii GGT 
AGGGTAAATACTACCATrGATTCTATTA TriCl i 1 r GGAATGCCAATTGGTAGTTrAGT TGCAG GAACGCTTATTGA 
TTTGAATATTGAATTAGTGTrAATTGCTATTAGCATACCTTAl I U 1 IGil 1 iLl iATAmi I i ATACGGATAATGG 
1 0 ATTG AAAG AATTTAGTATATATTAG 

4213.2 

ATGATGTCTAACAAAAATAAGGAAATTCTGATTTTTGCGATTCrrCTATACAGTCCTCTTTATGT^ 
ATTGCTGG ClU ' L ' lUU AATGCCATCTGCCATTGCAAATTATCITGTTTATGTAGTTTT 

15 GTTCAAGGATAGATTGATCCAACAATGGAAGGAGATTAGAAAGACTAAAAGAAAA'i'iViUC I'll GGACTCTT AAC 

AGGATGGCTOTTCTCATTCrGATGACTGTrGTCTTTGAATTTGTATCAGAGATGTTGAAGCAC^^ 
ATGGACAAGGTCTAAATCAGTCTAATATTCAAAGTACCTTTCAAGAACAACCACTACTGATAGCTGi'ri riGCTTG 
TGTCATTGGACCTCTGGTAGAAGAATTA rn 1 I CCGTCAGGTCTTATTGCATTACTTGCAGGAACGGTTGTCAGGTT 
TACTAAGCATTATTCTGGTAGGACTTG i ' lTl I GCTCTGACTCATATGCACAGTTTGGCTCTATCAGAGTGGATTGGT 

20 GCAGTTGGTTACTTAGGTGGAGGCCTTGCCTTTTCTATTATTTATGTGAAAGAAAAAGAGA^ 

TACTTGTrCACATGTTAAGCAACAGCCTCTCCTTAATCATTTTAGCTATCAGTATAGTAAAATGA 



25 

4224,1 

TTGAAAAAGCCAATTATCGAATTCAAAAACGTCTCTAAAGT T I T i GAAGACAGCAACACCAAGGTTCTCAAAGAC 

ATCAACTTTGAGTTGGAAGAAGGGAAATTCTACACCCTTCTAGGTGCATCTGGTTCGGGGAAATCAACTATCCT 

ACATTATTGCAGCTrrACTGGATGCGACGACAGGAGATATCATGCTAGACGGTGTTCGTATCAATGATATTCCAAC 

30 CAACAAGCGCGACGTACATACCGTCTTCCAATCCTATGCCTTGTTCCCACATATGAATGTGTTTGAAAATGTTGCC 
TTTCCACTTCGCTTGCGTAAAATTGATAAGAAAGAAATCGAGCAGCGTGTAGCGGAAGTTCTCAAGATGGrrCAG^ 
TGGAACGTTATGAAAAACGTTCCATCCGCAAACrrrCTGGAGGACAACGTCAGCGTGTGGCCATCGCCCGTGCTA 
TCATCAACCAACCCCGTGTGGTCTTGTTGGACGAGCCTTTATCAGCGCTGGACTTGAAATTGAGAACAGACATGCA 
GTACGAATTGCGTGAATTACAACAACGATTGGGCATTACCTTTCTCrrTGTCACTCACGATCAGGAAGAAGCT 

35 GCCATGAGTGACTGGATTTTCGTTATGAATGATGGCGAGATTGTCCAGTCTGGAACCCCTGTGGACATCTACGATG 
AGCCAATCAACCACTTTGTTGCCACCTTTATCGGGGAGTCAAACATCTTGCCAGGTACCATGATTGAGGACTACT^ 
GGTCGAATTTAACGGCAAACGCTTTGAAGCGGTTGATGGTGGGATGAAGCCAAATGAACCTGTTGAG GTCGT TAT 
TCGTCCAGAGGACrrGCGCATTACCCTTCCTGAAGAAGGCAAGCTCCAAGTTAAGGTCGATACCCAGCTTTTCCGT 
GGAGTTCATTATGAAATTATCGCCTATGACGAACTTGGAAATGAATGGATGATCCACTCAACCCGTAAGGCTATCG 

40 TGGGTGAGGAAATCGGTCTGGACTTTGAACCAGAAGACATCCACATCATGCGTCTCAATGAAACCGAAGAAGAGT 
TCGATGCTCGTATTGAGGAGTACGTAGAAATCGAAGAGCAAGAAGCAGGTTTGATCAATGCAATCGAGGAGGAAA 
G AG ATG AAG AA AACAAG CTCT AA 

4252.1 

45 ATGAAATCAATGAGAATCrrTATTTrrGTTAGCTn*AATTCAAj\TCACTT^ 

GCATCTTGTCCTTTAAACAAAGTACAGCl 1 i 11 ICATCGGAAGCATGGTTTTCGTTTCAGGAATCTGT GCTG GAGT 
AAATTATCTTTATACCCCTAAGCAAGAAGTCCATAGTGTCCTAGCCAGTAAGAAGTCGGTGAAGCri'l'i 1 1 ACAGT 
ATGTTACTCTTAATTAATTTGTTAGGAGCTGTTCTTGTTTTGTCAGATAACTTGTTCATC^ 

AGAATTAGTTGA ClTn l ATTGCCATCC n L I I 1 1 ' i CCT ATTTGGGCTAGATTTGCTG A nTn T i ACCCTTG AAAAA 
50 ATACGTGCGCGAl l ri C 1 1 GCTATGCTGGACAGAAAAAAGACAGTGTTGGTGACTATTTTAGCAACACri'Crri"! C 

TTAAGAAATCCAATGACCATTGTCTCACTTCTGATTTATATTGGACTGGGCTTG U ' lUn ' lU GCA 
AAATTCGGTTAAGAAGGAAGTTTCC !■ I'l' 1 ATGGTCATATTTTCCGAGATCTTGTATTGGTCATTGTTACGCTCATTT 
TCTTTTAG 

55 4252.2 

ATGGTTAAAAAAATTATTGGAATGGTGCTAGCTTTACTTTCTGTAACTGTAGTAGGAGTAGGTG 11 ' ^ I ' l GCTTATAC 
TATTTATCAACAAGGGACAGAAACCTTAGCTAAAACCTATAAAAAAATCGGTGAAGAAACCAAGGTTATTGAAGC 
GACTGAACCTCTAACCATTCTGTTAATGGGAGTGGaCACCGGAAATGTTGAACGAACTGAAACTTGGGTCGGTAG 
AAGTGATAGCATGATCTTGATGACAGTGAATCCTAAAACGAAAAAAACAACAATGATGAGTTTAGAGCGGGATAT 

60 TCTGACGCGCATTGAATCAGGGAATGGTCAGGCTCATGAAGCGAAACTGAACTCAGCATATGCAGATGGTGGAGC 
AGAGCTTGCTATAGAAACCATTCAAAAAATGATGAATATCCATATTGATCGCTATGTGATGGTCAATATGAGAGG 
ATTGCAAAAACTAGTGGATGCAGTAGGAGGTATTACAGTCAATAATATCCTAGGTTTCCCAATTTCTATCAGTGAC 
CAAGAAGAATTTAATACTATTTCTATCGGTGTTGGGGAGCAACATATTGGGGGAGAAGAAGCCCTAGTCTATGCA 
CGAATGCGTTACCAAGATCCTGAGGGGGATTATGGTCGTCAAAAACGTCAACGTGAAGTTATTCAAAAAGTCATG 

65 GAAAAAGCTCTCAGTTTAAATAGCATTGGTCATTATCAAGAGATTCTAAAAGCTTTGAGTGACAATATGCAGACC 
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AATATTGATTTGTCTGCAAAAACn'ATCCCTA.^CrrGCTAGGCTATAAAGATTCATrrA^^ 

AGTTGCAGGGTGAAGGAGAGATACTTCA.\GGTG i 1 tui 1 ACCAGATTGTTTCGAGAGCA CATA TGTTGGAAATGCA 

AAATCTACTCCGACGTTCTTTGGGACAAGAAGAACTTACrCAGCTTGAAACCAAT^ 

TTrGGCAGAGCACCTGTTGGTGATGAAGATAATTAA 

4256.2 

ATGAAAAAACAAGCCTATGTCATTATTGCTCTCACCTCCTTCCTATTTGTCrn^^ 

AATACTTGATTTTGACTGGTCTAT Cl i 1 l - i GCACGATGTCGAAAAAACAGAAAAATTrGTCri-rrt ATTGTrGGTTT 

TCAGCATGTCCATGACCTGTCTCTTAGCCCTGTTTrGGCGAGGGATCGAAGAGCTTTCTCTAAGAAA^ 

TAATCTCAAGCGTTTATT AGCAG GGCAAGAAGTGGTTCAGGTTGCAGATCCAGATTTGGATGCCAGTTTCA^ 

TTATCAGGTAAACTTAACCTTTTGACAGAGGCTCTTCAAAAAGCTGAAAATCAGAGCCTTGCT 

ATCATCGAGAAGGAACGGAAGCGAATTGCTCGGGATTTGCACGATACAGTCAGTCAGGAGTTGTTTGCGGCCCAC 

ATGATTrTATCGGGTATCAGTCAGCAGGCTTTGAAATTGGATAGAGAAAAGATCCAGACCCAGTTGCAGAGTGTC 

ACAGCTATTTTAGAAACAGCCCAGAAGGATTTGCGGGTTTTGCTCTTGCATTTGCGACCAGT^ 

AGAGCTTGATAGAAGGGATTCAAATTCTTTTAAAAGAGCTTGAGGACAAGAGTGATCTTAGGGTTAGTCT 

AGAATATGACGAAATTGCCTAAGAAAATCGAGGAGCATATCTTCCGTATCCTGCAAGAGTTGATTAGCAATACCC 

TCCGCCATGCCCAGGCATCTTGCCTAGATGTCTACCTCTATCAGACAGATGTTGAATTGCAACTGAAGGTGGTGGA 

CAATGGGATTGGTTrCCAGTTAGGGAGCTTAGACGACTTGAGTTATGGACTGCGAAATATCAAGGAGCGGGTTGA 

AGATATGGCTGGAACAGrrCAACTCTTGACAGCTCCCAAGCAAGGGCTGGCGGTTGATATCCGTATTCCCCTGrTA 

GATAAGGAATGA 

4263.1 

ATGATTGTTTCCATTATTTCTCAAGGATTTGTCTGGGCTATTCTAGGTCTGGGAATCTTTATGACAT^ 

AAAarrCCAGATATGACGACAGAAGGTTCCrrCCCTCTTGGGGGAGCTGTTGCTGTCACTT^ 

GTGAACCCATTTTTAGCGACACTTGTTGCTGTAGGAGCAGGTTGTTTGGCTGGAATGGCAGCAGGCC^ 

CAAAAGGGAAGATCCCAACCTTGCTCTCAGGGATrrTGGTGATGACTTCTTGTCACTCA^ 

GGGACGTGCGAATTTAGGCCTGCTTGGAACCAAGCAAATTCAGGATCm 

CTCTTGACAGGTCTCATCTTTGTGAGTArrGTTATTGCTCTCATGCTCTTTTTCTTGG 

CTATATTGCTACAGGGGATAATCCTGATATGGCTAGAAGTTTCGGGATTCATACTGGACGCATGGAGCTCATGGGC 

TTGGTCTTATCAAATGGTGTGATTGCCCTTGCAGGTGCCCrCATTGCTCAGCAAGAAGGTTATGCCGATCTGTCT^ 

GAGGGATCGGGGrrATCGTTGTGGGGCTTGCAAGTTTGATTATrGGAGAAGTTATTn'CAAGAGTTTGAGOT 

AGAGCGrrTGGTTACTATCGTTGTAGGTTCTATCGCTTATCAATTTTTAGTGTGGGCAGTTATCGC^ 

ATACAAGTTACCTTCGTTTATACAGTGCCTTGATTTTAGCAGTCTGCCTCATGATTCCAACATT^ 

CTTGAAAGGAGCCAAGTTAAGCAAATGA 

4346.1 

ATGAAAAAAATGAAAGTTTGGTCTACTGTACT^GC^^CGGGAGTTGCTCTTACTACACTTGCTGC^ 

GTTC AAATTCTACG ACTG CTTCTTCATCTG AAG AAAAAGCTG AT AAAAGTC AAGAATT AGTTATCTATTCC AACTC 

AGTCTCAAATGGTCGTGGTGATTGGTTAACTGCTAAAGCAAAAGAAGCTGCTTTTAATATAAAAATGGTT 

GCTGGCGCTCAATTAGCAGACCGTCTTATTGCTGAGAAGAATAATGCAGTTGCAGATATGGTArrrGGAATTGGTG 

CTGTTGATTCAAATAAAATTAGAGATCAAAAATTACTAGTACAGTACAAGCCTAAATGGTTAGATAAAATTGATC 

AATCITrATCAGATAAj\GATAATTATTATAATCCTGTGATTGTTCAACCATTAGTTTTA^ 

AAAGAAATGCCTAAAGATTGGACTGAATTAGGTAGTAAGTATAAAGGTAAATATTCAATTTCTGGTCTTCAAGG 

GGTACAGGACGGGCAArrCTAGCAAGTATCrrTAGTTCGATACCTTGATGATAAAGGTGAATTAGGTGTTrCCGA^ 

AAGGTTGGGAAGTAGCAAAAGAATATTTGAAAAATGCATACACTCTTCAAAAGGGAGAAAGTTCAATTC^ 

TGTTAGACAAAGAAGATCCAATACAATATGGAATGATGTGGGGTrCTGGTGCATTAGTTGGACAAAAACAACAAA 

ATGrrGTTTTCAAAGTTATGACTCCTGAGATTGGTGTACCATTTGTAACTGAACAAACTATGGTr^ 

TAAAAAACAAGCGrTAGCTAAAGAATTTATTGATTGGTTTGGTCAATCAGAAATTCAAGTAGAA^^ 

CTTTGGATCTATTCCTGCAAATAAAGATGCCCTCAAAGATCTACCTGAAGATACGAAGAAATTTGTTGA^^ 

AAACCACAAAATATTGACTGGGAAGCTGTTGGAAAGCATTTGGATGAATGGGTAGAAAAAGCTGAATTAGAATAC 

GTACAATAA 

4346.2 

ATGATTAAATTTGATAATATTCAAATTAAATATGGTGATTTTGTTGCAATTGATAATCTGAATT^ 
AAGGGGAATTTTTTACArt IViU'GGGCCTTCAGGATGTGGTAAATCAACTACTTTGAGAGCATTGGTAG G [ ■ rri CT 
AGATCCATCATCAGGAAGTATTGAAGTTAATGGAACAGATGTCACTCATTTGGAACCTGAAAAGCGTCGAATTCG 
TATTGTATTTCAATCTTATGCGCTATTTCCAACTATGACrGTTTTTGATAATATTGCA^ 

AGCTAGCTCCAGATGTTATTAAAGCTAAAGTATCAGCAGTGGCAGCAAAAATTAAGATCTCTGATCAACAGTTAC 
AGCG TAAT GTATCAGAATTATCTCGGGGTCAACAACAAAGGGTAGCATTGGCTCGTGCTCTGGTTCTTGAACCTAA 
AATTCTTTGTCTAGATGAACCATTGTCAAACCTTGACGCAAAATTACGTGTAGATTTGAGAAAAGAG^ 
CTTCAAAAAGAGTTAGGTATTACTACrrTATAT^ 

TTGCAGTCT TTAA CAATGGATACATCGA ACAGG TCGGTACACCAGTAGAGATTTATCATAATTCTCAAACTGAAT^ 
TGTATGTGATTTTATTGGAGATATTAATGTTTTGACCGATGAAACAGTCCACGAAG^ 

GTTrrCTTAGAGGATAAAAAAGGATACATTCGATTAGAGAAAGTTCGATTCAATCGTGAAACTGAACAAGATm 
TTCTAAAAGGGACAATTATrGATGTTGAGTTTTCTGGAGTTACAATTCACTATACAATAAAAGTr^ 
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GATrCTTAATGTAACAAGTATTGATACTCAGGCTGCTATTAGATCTGTCGGAGA-\AGTCTGGAATTATTTATCACA 
CCATCAGACGTTCTGCAATTTTAA 

4346.3 

5 ATGCGTCATAAATTAAATTTAAAAGATTGGCrrATTCGTTTAGGGTTAATCTGGTTCT^ 

TCCAAACTTTGATCTAGTAGTGAATGTATTTGTAAAAGGAGGAGAATTTTCCCTTGATGCT 
AAATCTCAGAGGGCACrrCAGAGTATTATGAACAGTTTTAAGTTAGCATTTTCACTCATTAT^ 
TAGGTATTCTTTGTGTTCTATTTACAGAGTACTTTGATATTAAAGGTGCT 
TCrrTAATTTATGGAGGAGTGGTTTTAGCGACTGGATATAAATT^ 

10 TTTACAAAATGTTATCCLi I CI I'l AGACCCTAACTGGTTTATTGGGTATGGTGCAGTCTTATTCArTATGACATnT 

CAGGAACTGCTAATCATACATTGTTTTTAACAAATACAATTCGAAGCGTTGACTATCACACTATTGACGCTGCTCG 
AAATATGGGAGCAAAACCATTTACTGTTTTCCGAAAAGTAGTCTTACCAACCTTAATTCCAACTCT 
ACTATT ATGG I' I ' 1 1 1 C V i" AGTGGTTTATCTGC ACT AGCAGCACCC ATG ATTGTTGGTGGTAAAG AATTTC AAACT AT 
AAATCCAATGATTATTACATTTGCAGGGATCGGGAATTCrCCTGATTTAGCTGCCCTACTTGCAATTATT^ 

1 5 ATTGCAACTACAATTTTGCTTACTATCATGAATAAGATAGAAAAAGGTGGAAATTATATTTCTATCT 

AAGCGCCTCTTAAJVAAACAAAAAATTGCGTCTAAGCCTTGGAATATCATTGCTCACATTGTAGCATATC 

CACAGTTTrCATG CTTC C ACTA Al'iUUUATAGTATTATACTCATTTACAGATCCAGTTGCAATTCAAACAGCT 

TAACATTATCAAACTTTACTTTAGAAAATTATCGCnTATTCTTTAGTAA 

AGCTTTATTTATTCTATTATTGCTGCGACAACAGCAACAATTCTCGCAGTTGTATTTGCTCGTGTTG 
20 TAAATCTCGTTTTGATTrCTTATTTGAATATGGTGCT 

GTTTATTATTTACTTTTAATCAGCCACAAi'l"! Ci IGTCTTGAATCAGATTTTGGTAGGTAGTTTGGTAATTCTACTT 

ATTGCATATATAGTTGTAAAAATCCCATTTTCTTATAGAATGGTACGTGCTATTTTATTTAGTC 

GG AAGATGCAGCAAGAA GTATG GGTGCrrCACCriU't'iATACTATGATCAAGGTTATCATTCCATTTATTTTACCG 

GTTGTTCTCrCTGTTATTGCTTTAAACTrrAACTCTTTATTAACT^ 
25 CTAGCTpAACCATTAGGTATTACGATTCGATCTGCAGGTGATGAAACAGCAACATCTAATGCACAAGCTCTGGTAT 

TTGTTTATACAATTGTTCTGATGATTATTTCTGGAACGGTATTATACTTCACACAAAGACCGGGGCGTAAAOT 

GAAATAA 
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Table 2 



MEELVTLDCLFIDRTKIEANANKYSFVWKKTTEKFSAKLQEQIQVYFQEErrPLLIKYAMFDBaCQKRGYKESAKNL^ 
HYNDKEDSYTHPDGWYYRFHHTKYQKTQTDFQQEIKVYYADEPESAPQKGLYMNERYQNLKABCECQALLSPQGRQIF 
5 AQRKIDVEPVFGQIKASLGYKRCNLRGKRQVRIDMGLVLMANNLLKYSKMKZ 

MGKGHWNRKRVYSIRKFAVGACSVMlGTCAVLLGGNlAGESVVYADETLmrrAEKPKEEKMrVEEKADKALETKNIV 

ERTEQSEPSSTEAIASEKKEDEAVTPKEEKVSAKPEEKAPRIESQASNQEKPLKEDAKAVTNEEVNQMIEDRKVDFNQN 

WYFKLNANSKEAIKPDADVSTWKKLDLPYDWSIFNDFDHESPAQNEGGQLNGGEAWYRKTFKLDEKDLKKNVRLTF 

10 DGVYMDSQVYVNGQLVGHYPNGYNQFSYDITKYLQKDGRENVIAVHAVNKQPSSRWYSGSGIYRDVTLQVTDKVHV 
EKNGTTILTPKLEEQQHGKVETHVTSKIV^^IT>DKDHELVAEYQ^V^ERGGHAVTGLVRTASRTUCAHESTSLDA^^^ 
PKLWTVLNDKPALYELrTRVYRDGQLVDAKKDLFGYRYYHWTPNEGFSLNGERIKFHGVSLHHDHGALGAEENYKAE 
YRRLKQMKEMGVNSIRTTHNPASEQTLQIAAELGLLVQEEAFDTWYGGKKPYDYGRFFEKDATHPEARKGEKWSDFD 
LRTMVERGKNNPAIFMWSIGNEIGEANGDAHSLATVKRLVKVIKDVDKTRYVTMGADKFRFGNGSGGHEECIADELDA 

15 VGFNYSEDNYKALRAKHPKWLIYGSETSSATRTRGSYYRPERELKHSNGPERNYEQSDYGNDRVGWGJCTATASWTFD 
RDNAGYAGQnWTGTDYIGEPTPWHNQNQTPVKSSYFGrVDTAGIPKHDFYLYQSQWVSVKKKPMVHLLPHWNWENK 
ELASKVADSEGKrPVRAYSNASSVELFLNGKSLGLBCTFNKKQTSDGRTYQEGANANELYLEWKVAYQPGTLEAIARDES 
GKElARDKriTAGKPAAVRLIICEDHAlAADGKDLTYIYYEIVDSQGNVVPTANNLVRFQLHGQGQLVGVDNGEQASRER 
YKAQADGSWIRKAFNGKGVAIVKSTEQAGKFTLTAHSDLL^NQVTVFTGKKEGQEKTVLGTEVPKVQTIIGEAPEMPT 

20 TVPFVYSDGSRAERPVTWSSVDVSKPGIVTVKGMAIXjREVEARVEVIAUCSELPVVKRIAPOTDLNSVDKSVSYVUDGS 

veeyevdk\^^iaeedicaklaipgsriqatgylegqp[hatlvveegnpaapavptvtvggeavtgltsqkpmqyrtla 
ygaklpevtasaknaavtvlqasaangmrasihqpkdggplqtyaiqfleeapkiahlslqvekadslkedqtvklsv 
rahyqdgtqavlpadkvtfstsgegevairkgmlelhkpgavtlnaeyegakdqveltiqantekkiaqsirpvnvvt 
dlhqepslpatvtveydkgfpkthkvtwqaipkekldsyqtfevlgkvegidlearakvsvegivsveevsvttpiaeap 

25 qlpesvrtydsnghvssakvawdairpeqyakegvftvngrlegtqlttklhvrvsaqteqga^asdq^^tgselplaf 
asdsnpsdpvsnvndklisynnqpanrwtnwnrtnpeasvgvlfgdsgilskrsvdklsvgfhedhgvgvpksyviey 
yvgktvptapknpsfvgnedhvfndsanwkpvtnlkapaqlkagemnhfsfdkvetyavrirmvkadnkrgtsrrev 
qifakqvaaakqgqtriqvdgkdi^nfnpdltdyylesvixskvpavtasvsnnglatvvpsvregepvrviakaengd 
ilgeyrlhftkdksllshkpvaavkqarllqvgqalelptkvpvyftgkdgyetkdltveweevpaenltkagqftvr 

30 grvlgsnlvaerrvrvtdklgetlsdnpnydensnqafasatndidknshdrvdylndgdhsenrrwtnwsptpssnp 
evsagvifrengkivertvtqgkvqffadsgtdapsklvleryvgpefevptyysnyqaydadhpfnnpenweavpyr 
adkdiaagdeinvtfkaikakamrwrmerkadksgvamiemtflapselpqestqskilvdgkeladfaenrqdyqit 
ykgqrpkvsveennqvastvvdsgedsfpvlvrlvsesgkqvkeyrihltkekpvsektvaavqedlpkiefvekdlay 
ktvekkdstlylgetrveqegkvgkeriftainpdgskeeklrevvevptdrivlvgticpvaqeakkpqvsekadtkpid 

35 sseasqtnkaqlpstgsaasqaavaagltllglsaglvvtkgkked2 

mkimkkkywtlailffclfnnsvtaqeipknldgnithtqtsesfsesdekqvdysnknqeevdqnkfriqidktelfvt 
tdkhleknccklelepqinndivnsesnnllgednldnkikenvshldnrggniehdkdnlessivrkyewdidkvtgg 
gesyklysicsnskvsiaxldsgvdlqntglucnlsnhsknyvpnkgylgkeegeegusdiqdrlghgtavvaqrvgddn 
40 [ngvnphvninvyrifgkssaspdwivkalfdavddgndanlstgqylmidgeyedgtndfetflkykkaldyanqkgv 
[ivaalckdslnvsnqsdllklrssrkkvrkpglvvdvpsyfsstisvggidrlcnlsdfsnkgdsdalyapacstlslsel 
glnnfinaekykedwifsatlggytylycnsfaapkvsgaiamiidkyklkdqpynymfvkkfwkkhyqz 

mkictwkvfltlvtalvavvlvacgqgtaskdnkeaelkkvdfildwtpntnhtglyvakekgyfkeagvdvdliclp 
45 peesssdlvingicapfavyfqdymaicklekgagrravaarvehntsgiisrksdnvsspkdlvgkkygtwndptelaml 
ktlvesqggdfekvekvpnndsnsitp[angvfdtawryygwogilaksqgvdanfmylkdyvkefdyyspviiannd 
ylkdnkeearkviqaikkgyqyamehpeeaadiliknapelkekrdfvtesqicylskeyasdkekwgqfdaarwnafy 
kwdkengilkedltdkgftnefvk2 

50 mkrtwrnsfvtnlm-pfmigkieipnrtvlapmagvtnsafrtiakelgaglvvmemvsdkgiqynnektlhmlhide 
genpvsiqlfgsd£dslaraaefiqe^r^ctdivdinmgcpvnkivkneagamwlkdfd^c^yslinkvqsvldlpltvkmr 
tgwadpslavenalaaeaagvsalamhgrtreqmytghadletlykvaqaltkipfiangdirtvqeakqrieevga 
davmigraamgnpylfnqinhyfetgeilpdltfedkmkiayehlkrlinlkgenvavrefrglaphylrgtsgaakl 
rgaisqastlaeietllqleka2 

55 

MIKNPKLLTKSFLRSFAILGGVGLVIHlAIYLTFPFYYIQLEGEKFNESARVFTEYLfCTKTSDEIPSLLQSYSKSLTISAHLK 
RDIVDKRLPLVHDLDIKDGKLSNYIVMLDMSVSTADGKQVTVQFVHGVDVYKEAKNILtLYLPYTFLVTlAFSFVFSYF 
YTKRLLNPLFYISEVTSKMQDLDDNIRFDESRKDEVGEVGKQtNGMYEHLLKVIYELESRNEQrVKLQNQKVSFVRGAS 
H£LK:TPLASLRJILENMQHNIGDYKDHPKYIAKSINK!DQMSHLL£EVLESSKFQEWTECR£TTVKPVLVD[LSRYQELAH 
60 SIGVTIENQLTDATRVVMSLRALDKVLTNUSNAIKYSDKNGRVIISEQDGYLSIKNTCAPLSDQELEHLFDIFYHSQIVTD 
KDESSGLGLYTVNNILESYQMDYSFLPYEHGMEFKISLZ 

MYLGDLMEKAECGQFSIL^FLLQESQTTVKAVMEETCFSKATLTKYVTLLNDICALDSGLELAlHSEDENLRLSiGAATK 

grdirslflesavkyqilvyllyhqqflahqlaqelviseatlgrhlaglnqilsefdlsiqngrwrgpehqihyfyfcl 
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FRKVWSSQEWEGHMQKPERKQEIANLEEICGASLSAGQKLDLVLWAHISQQRLRVNACQFQVIEEKMRGYFDNIFYLR 
LLRKVPSFFAGQHIPLGVEDGEMMIFFSn.LSHRrLPLHTMEyiLGFGGQLADLLTQUQEMKKEELLGDYTEDHVTyEL 
SQLCAQVYLYKGYILQDRYKYQLENRHPVLLMEHDFKETAEEIFHALPAFQQGTDLDKKILWEWLQLrEYMAENGGQ 
HMRIGLDLTSGFLVFSRMAAILKRYLEYNRFITIEAYDPSRHYDLLVTNNPIHKKEQTPVYYLKNDLDMEDLVAIRQLLF 

5 TZ 

MEFSKlO'RELSIKXMQERTLDLUIGCGrrGAGVALQAAASGLETGLIEMQDFAEGTSSRSTKLVHGGLRYLK^^ 
VSDTVSERAVVQQlAPHIPKSDPMLLPVYDEDGATFSLFRLKVAMDLYDLUSLGVS^^TAANKVLSKDQVLERQP^a.KK 
EGLVGGGVYLDFRNNDARLVIENIKRANQDGALlANHViCAEGFLFDESGKrrGVVARDLLTDQVFEIKARLVINTTGPW 
10 SDKVRNl^NKGTQFSQMRPTKGVHLVVDSSKIKVSQPVYFDTGLGIX;RMVFVLPRE^^aYFGT^)TO 

QEDVDYLLGIVNNRFPESNITIDDIESSWAGLRPLIAGNSASDYNGGNNGTISDESFDNLIATVESYLSKEKTREDVESAV 
SKLESSTSEKHLDPSAVSRGSSLDRDDNGLLTLAGGKrrDYRKMAEGAMERVVDU^EFDRSFKUNSKTYPVSGGELN 
PAWDSEIEAFAQLGVSRGLDSKEAHYLANLYGSNAPKVFALAHSLEQAPGLSLADTLSLHYAMRNELTLSPVDFLLRR 
TNHMLFMRDSLDSIVEPILDEMGRFYDWTEEEKATYRADVEAALANNDLAELKN2 

15 

MMNELFGEFLGTLIULLGNGVVAGVVLPKTKSNSSGWTVITMGWGIAVAVAVFVSGKLSPAYLNPAVTICVALKGGLP 

WASVLPYILAQFAGAMLGQILVWLQFKPHYEAEENAGNILATFSTGPAIKDTVS^fLrSEILGTFVLVLTIFALGLYDFQA 

GIGTFAVGTLIVGIGLSLGGTTGYALNPARDLGPRIMHSILPIPNKGDGDWSYAWIPVVGPVIGAALAVLVFSLFZ 

20 MTKKKmRISVmimKILWLKWYFMRDKEQPKYSVtERKMFDAAKNQbMLAYQKYATIKQrro 
VYVYNHMNVIGACQRILFISQSPAYDKLNKWFNIYSDLYFSVVPLPKMGVYHEMVGE 

MKNSNEAEMKLLYTDIRTSLTEILTREAEELVAAGKRVFYIAPNSLSFEKERAVLEYl^QQASFSITVTRFAQMARYLVL 
NDLPAKTTUDDIGLGLAFYKCLAELDPKDLRVYGAIKQDPQLIQQLIELYHEMTKSQMSFLDLENLTDEDKRADLLLIF 

25 EKVTAYLNQGQLAQESQLSHLIEAIENDKVSSDFNQlALVIDGFTRFSAEEERVVDLLHGKGVEtVlGAYASKKAYTSPFS 
EGNLYQASVKFLHHLASKYQTPAQDCSQTHEKMDSFDKASRLLESSYDFSELALDVDEKDRENLQIWSCLTQKEELEL 
VARSIRQKLHENSDLSYKHFRILLGDVASYQLSLKTIFDQYQIPFYLGRSEAMAHHPLTQFVESILALKRYRFRQEDUNL 
LRTDLYTDLSQSDIDAFEQYlRYLGINGLPAFQQTFTKSHHGKFNLERLNVLRLRiLAPLETLFASRKQfCAEKLLQKWSV 
FLKEGAVTKQLQDLTTTLEAVEQERQAEVWKAFCHVLEQFATVFAGSQVSLEDFLALLHSGMSLSQYRTIPATVDTVL 

30 VQSYDLIAPLTADFVYAlGLTQDNLPKISQNTSLLTDEERQNLNQATEEGVQLLIASSENLKKNRYTMLSLVISfSARKQLF 
LSAPSLFNESESKESAYLQEUHFGFRRREKRMNHKGLSKEDMGSYHSLLSSLVAYHQQGEMSDTEQDLTFVKVLSRVI 
GKKLDQQGLENPAIPTSPSSKTLAKDTLQALYPAKQEFYLSTSGLTEFYRNEYSYFLRYVLGLQEELRLHPDARSHGNFL 
HRIFERALQLPNEDSFDQRLEQAIQETSQEREFEAIYQESLEAQFTKEVLLDVARTTGHILRHNPAIETIKEEANFGGKDQ 
AFIQLDNGRSVFVRGKVDRIDRLICANGAIGVVDYKSSLTQFQFPHFFNGLNSQLPTYLAALKREGEQNFFGAMYLEMA 

35 EPVQSLMAVKSLAGAVVEASKSMKYQGLFLEKESSYLGEFYNKNKANQLTDEEFQLLLDYNAYLYKKAAEKILAGRF 
AlNPYTENGRSlAPYVQQHQAirGFEANYHLGQARFLEKLDLADGKRLVGEKLKQAWLEKIREELNRZ 



MKLIPFLSEEEIQKLQEAEANSSKEQECKTAEQIEAIYTSAQNILVSASAGSGKTFVMAERILDQLARGVEISQLFISTFTVK 
40 AATELKERLEKKISKKIQETDDVDLKQHLGRQLADLPNAAIGTMDSFTQKFLGKHGYLLDIAPNFRILQNQSEQLILENE 
VFHEVFEAHYQGKQKETFSHLLFCNFAGRGKDERGLRQQVYKIYDFLQSTSNPQKWLSESFLKGFEKADFTSEKEKLTE 
QIKQALWDLESFFRYHLDNDAKEFAKAAYLENVQLILDEIGSLNQESDSQAYQAVLARVVAISKEKNGRALTNASRKA 
DLKPLADAYNEERKTQFAKLGQLSDQIAILDYQERYHGDTWKLAKTFQSFMSDFVEAYRQRKRQENAFEFADISHYTIE 
ILENFPQVRESYQERFHEVMVDEYQDTNHlQERMLELLSNGHNRFMVGDIKQSIYRFRQADPQtFNEKFQRYAQNPQEG 
45 RLULKENFRSSSEVLSATNDVFERLMDQEVGEINYDNKHQLVFANTKLTPNPDNKAAFLLYDKDDTGEEEESQTETKL 
TGEMRLVIKEILKLHQEKGVAFKEIALLTSSRSRNDQILLALSEYGIPVKTDGEQNNYLQSLEVQVMLDTLRVIHNPLQD 
YALVALMKSPMFGFDEDEUVRLSLQKAEDKVHENLYEKLVNAQmASSQKGLlKTALAEKLKQFMDILASWRLYAKT 
HSLYDUWKIYNDRFYYDYVGALPNGPARQANLYALALRADQFEKSNFKGLSRFIRMIDQVLEAQHDLASVAVAPPKD 
AVELMTIHKSKGLEFPYVFILNMDQDFNKQDSMSEVILSRQNGLCVKYIAKMETGAVEDHYPKTIKLSIPSLTYRQNEEE 
50 LQLASYSEQMRLLYVAMTRAEKKLYLVGKGSREKI-ESKEYPAAKNGKLNSOTRLQARNFQDWLWAISKVFrKlDKLNF 
SYRFIGEE>QLTR£AIGELETKSPLQDSSQADNRQSDTIKEALEMLKEVEVYNTLHRAAIELPSVQTPSQIKKFYEPVMDM 
EGVEIAGQGQSVGKKISFDLPDFSTKEKVTGAEIGSATHELMQRIDLSQQLTLASLTETLKQVQTSQAVRDKINLDKILAF 
FDTVLGQEILANTDHLYREQPFSMLKRDQKSQEDFVVRGILDGYLLYENKIVLFDYKTDRYDEPSQLVDRYRGQLALY 
EEAL5RAYSIENIEKYULLGKDEVQVViCVZ 

MELARHAESLGVDAIATIPPIYFRLPEYSVAKYWNDISSAAPNTDYV[YNIPQLACVALTPSLYT£MLKNPRVIGVKNSS 
MPVQDIQTFVSLGGEDHIVFNGPDEQFLGGRLMGARAGIGGTYGAMPELFLKLNQLlADKDLETARELQYAlNAIIGiCt 
TSAHGNMYGVIKEVLKINEGLNIGSVRSPLTPVTEEDRPVVEAAAALIR£TKERFL2 

60 

MYKTKCLREKLVLFLKIFFPILIYQFANYSASFVDTAMTGQYNTMDLAGVSMATSIWNPFFTFLTGIVSALVPnCHHLG 
RGKKEEVASDFYQFIYLALGLSVVLLGMVLFLAPIILNHIGLEAAVAAVAVRYLWFLSIGIIPLLLFSVIRSLLDSLGLTKL 
SMYLMLLLLPLNSGFNYLLIYGAFGVPELGGAGAGLGTSLAYWVLLGISVLVLFKQEKLKALHLEKRIPLNMDKIKEGV 
RLGLPIGGTVFAEVAIFSVVGLIMAKFSPLIIASHQSAMNFSSLMYAFPMSISSAMAIVVSYEVGAKRFDDAKTYIGLGRW 
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TALIFAAFIXTFLYIFRGNVASLYGNDPKFlDLTVRiT-TYSLFFQLADTFAAPLQGILRGYKDTVIPFYLGLLGYWGVAIP 
VYAIZ 

MSTLAKIEAIXFVAGEDGIRVRQU^LI^LPPTGIQQSLClCI^QKYEKDPDSSLALIETSGAYRl-\^ 
5 PINQSt^RAALETI^IlAYKQPrrRIEmAmGVNSSGAlAKLQAFDLrKEDGKKEVLGRPNLYVTTDYFL^ 
PVIDELEIQAQESQLFGERIEEDENQZ 

MDTMISRFFRHLFEALKSUCRNGWMTVAAVSSVMrrLTLVAIFASVIFNTTAKLATDIENNVRVVVYIRKDVEDNSQra 
KEGQTVTNNDYHKVYDSLKNMSTVKSVTFSSKEEQYEKLTEIMGDNWKIFEGDANPLYDAYIVEANTPNDVICTIAEDA 
10 KKIEGVSEVQDGGAm-ERLFKLASnRVWGLGIAALLIFlAVFUSNTrRTniSRSREIQIMRLVGAKNSYIRGPFLLEGAFIG 
LLGAlAPSVLVFIVYQIVYQSVNKSLVGQNLSMISPDLFSPLMlALLFVIGVFrGSLGSGISMRRFLKIZ 

MKKVRFIFLALLFFLASPEGAMASDGTWQGKQYLKEDGSQAANEWFDTHYQSWFYIKADANYAENEWUKQGDDYF 
YLKSGGYMAKSEWVEDKGAFYYLDQDGKMKRNAWVGTSYVGATGAKVIEDWVYDSQYDAWFYIKADGQHAEKEW 

15 LQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYQSWFYTKENGNYADKEWIFENGHYYYUCSGGY 
MAANEWtWDKESWFYUCFIXJKMAEKEWYDSHSQAWYYFKSGGYMTANEWIWDKESWFYLKSDGKL^EKEWVY 
SHSQAWYYFKSGGYMTANEWIWDKESWFYUCSIX3iClA£KEWVYDSHSQAWYYFKSGGYMAKNETVDGYQLGSDGK 
WLGGKTTNENAAYYQVVPVTANVYDSDGEKLSYISQGSVVWLDKDRKSDDKRLAmSGLSGYMKTEDLQALDASKD 
FIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAEELDKVFSLLNI 

20 hWSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKL\KDKKNFFGrrAYDTTPYl5AKTFDDVDKGIL^ 
KENYIDRGRTFLG^fKASGM^rVEYASDPYWGEKIASVMMKINEKLGOKD2 

MKKVLQKYWAWAFV^rPLLLQAIFFYVPMFQGAFYSFTNWTGLTY^^V^CFVGLNNFKLLFMDPKFMNAIGF^A 
WGEIALGIFIARVLNSKIKGQTFFRAWFFFPAVLSGLTVALIFKQVFNYGLPAIGNALHIEFFQTSLLGTKWGAIFAAVF 
25 VLLWQGVAMPIIIFLAGLQSIPTEITEAARIDGATSICQVFWhnELPYLLPSVSMVFILALKGGLTAFDQVFAMTCGGPNN 
ATTSLGULVYNYAFKNNQFGYANAIAVILFFLIVVISIIQLRVSKKFEIZ 

MMKQDERKALIGKYILLILGSVLILVPLLATU=SSFKPTKDIVDNFFGFPTNFTWDNFSRLLADGIGGYYWNSVVrrVLSL 
LAVMinPMAAYSlARNMSKRKAFTIMYTLLILGIFVpFQVlMIPrrVMMSKLGLAOTFGLILLYLTYAlPQTLFLYVGYIKI 
30 SIPESLDEAAErDGANQFTTYFRIlFPMMKPMHATTMIINALWFWNDFMLPLLVLNRDSKMWTLPLFQYNYAGQYFND 
YGPSFASYVVGIISmVYLFFQRHIISGMSNGAVKZ 

MKSILQKMGEHPMLLLFLSYSTVISILAQNWMGLVASVGMFLFTIFFLHYQSILSHKFFRLILQFVLFGSVLSAAFASLEH 
FQIVKKFNYAFI^PNMQVWHQNRAEVTFFNPNYYGIICCFCIMIAFYLFTTTKLNWLKVFCVIAGFVNLFGLNFTQNRT 
35 AFPAIUGAIIYLFTTIKNWKAFWLSIGVFAIGLSFLFSSDLGVRMGTLDSSMEERISIWDAGMALFKQNPFWGEGPLTYM 
NSYPRIHAPYHEHAHSLYIDTILSYGIVGTILLVLSSVAPVRLMMDMSQESGKRPIIGLYL^FLTVVAVHGIFDLALFWIQS 
GFIFLLVMCSIPLEHRMLVSDNfTDZ 

MSKMDVQKIIAPMMKFVNMRGIUUCDGMLAILPLTVVGSLFUMGQLPFECLNKSIASVFGANWTEPFMQVYSGTFAI 
40 MGLISCFSIAYSYAKNSGVEALPAGVLSVSAFFILLRSSYIPKQGEAIGDAISICVWFGGQGIIGAIIIGLVVGSIYTFFIKRKIV 
IKMPEQVPQAIAKQFEAMIPAFVIFI^SMIVYII^KSLTNGGTFIEMIYSAIQVPLQGLTGSLYGAIGIAFFISFLWWFGVH 
GQSVVNG\^ALLLSm,DANKAMLASANLSLENGAHIVTQQFLDSFLrLSGSCITFGLVVAMLFAAKSKQYQALGKVA 
AFPAIFNVN£PVVFGFPrVMNPVMFVPP!LVPVU\AVIVYGAlATGFMQPFSGVTLPWSTPAlLSGFLVGGWQGVrrQLVl 
LAMSTLVYFPFFKVQDRLAYQNEIKQS2 

MKKKDLVDQLVSEIETGKVRTLGIYGHGASGFCSTFAQELYQALDSTTVNLLETDPYITSGRHLVVPKDAPNQKVTASLP 
VAHELESLQRDILACRRVWMSZ 

MKKRYLVLTALLALSLAACSQEKTKNEDGETKTEQTAKADGTVGSKSQGAAQKKAEVVNKGDYYSIQGKYDEIIVAN 
50 KHYPL5KDYNPGENPTAKAELVKLIKAMQEAGFPISDHYSGFRSYETQTKLYQDYVNQDGKAAADRYSARPGYSEHQT 
GLAFDVIGTDGDLVTEEKAAQWLLDHAADYGFVVRYLKGKEKETGYMAEEWHLRYVGKEAKEIAASGLSLEEYYGF 
EGGDYVD2 

MREPDFLNHFLKKGYFKKHAKAVLALSGGLDSMFLFKVLSTYQKELEIELJLAHVNHKQRIESDWEEKELRKLAAEAE 
55 LPIYISNFSGEFSEARARNFRYDFFQEVMKKTGATALVTAHHADDQVETIFMRLIRGTRLRYLSGIKEKQVVGEIEIIRPFL 
HFQICKDFPSiFHFEDTSNQENHYFRNRIRNSYLPELEKENPRFRDAILGIGNEILDYDLAIAELSNNlNVEDLQQLFSYSES 
TQRVLLQTYLNRFPDLNLTKAQFAEVQQILKSKSQYRHPlKNGYELtKEYQQFQiCfCISPQADEKEDELVLHYQNQVAY 
QGYLFSFGLPLEGELIQQIPVSRETSIHIRHRKTGDVLIKNGHRKKLRRi-FIDLKIPMEKRNSAUIEQFGElVSILGIATNNL 
SKKTKNDiMtsrrVLYIEKIDRZ 

60 

MRKFUILLLPSFLTISKVVSTEKEVVYTSKEIYYLSQSDFGIYFREKLSSPMVYGEVPVYANEDLVVESGKLTPKTSFQIT 
EWRLNKQGIPVFKLSNHQFlAADKRFLYDQSEVTPTrKKVWLESDFKtYNSPYDLKEVKSSLSAYSQVSIDKTMFVEGRE 
FLHlDQAGWVAKESTSEEDNRMSKVQEMLSEKYQKDSFSIYVKQLTTGKEAGINQDEICMYAASVLiaSYLYYTQEKIN 
EGLYQLDTTVKYVSAVNDFPGSYKPEGSGSLPfCKEDNKEYSLKDLITKVSKESDNVAHNLLGYYlSNQSDATFKSKMSA 
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IMGDDWDPKEmSSKMAGKFMEAIYNQNGFVLESLTKTDFDSQRiAKGVSVKVAHKIGDADEFKHDTGVVYADSPFIL 
SIFTKNSDYDTISKIAKDVYEVLKZ 

MKXQNNGLIKNPFLWLLFIFFLVTGFQYFYSGNNSGGSQQINYTELVQEITDGNVKELTYQPNGSVIEVSGVYKNPKTSK 
5 EETGlQFFTPSVTKVEKFTSTlLPADTTVSELQKLATDHKAEVTVBCHESSSGrWINLLVSrvPFGILFFFLFSMMGNMGGG 
NGRNPMSFGRSKAKAAKKEDIKVRFSDVAGAEEEKQELVEVVEFLKDPKRFTKLGARIPAGVLLEGPPGTGKTLLAK^ 
VAGEAGVPFFSISGSDFVEMFVGVGASRVRSLFEDAKKAAPAIIFIDEIDAVGRQRGVGLGGGNDEREQTLNQLLIEMDG 
FEGNEGIIVlAAThTRSDVLDPALLRPGRFDRKVLVGRPDVKGREAILKVHAKNKPLAEDVDLKLVAQQTPGFVGADLEN 
VLNEAALVAARRNKSnDASDIDEAEDRVIAGPSKKDFTTVSQKERELVAYHEAGHTIVGLVLSNARVVHKVTIVPRGRA 
10 GGYMIALPKEDOMLLSKEDMKEQU^GLMGGRVAEEIIFNVQTTGASNDFEQATQMARAMVTEYGMSEKLGPVQYEG 
NHAMLGAQSPQKSISEQTAYEIDEEVRSLLNEARNKAAEIIQSNRETHKLIAEALLKYETLDSTQIKALYETGKMPBAVE 
EESHALSYDEVKSiCMNDEKZ 

MKRSSLLVRMVISIFLVFULLALVGTFYYQSSSSAlEATIEGr^SQTTISQTSHFIQSYlKKLETTSTGLTQQTDVLAYAENP 

15 sqdkvegirdlfltilksdkduo'vvlvtksgqvistddsvqmktssdmmaedwyqkaihqgampvltparksdsqw 
visvtqelvdakganlgvlrldisyetleaylnqlqlgqqgfafiinenhefvyhpqhtvysssskmeamkpyidtgqg 
ytpghksyvsqekiagtdwtvlgvsslekldqvrsqllwtllgasvtsllvclclvwfslkrwiapucdlretmleias 
gaqnlrakevgayelr£vtrqfnamldq[dqlmvalrsoeettrqyqlqalssqinphflyntldtinvmaefhdsqr 
vvqvtkslatyfrlalnqgkdliclsdeinhvrqylfiqkqrygdkleyeinenvafdnlvlpklvlqplvenalyhgi 

20 ^ceke<kk5h[klsvqkqdsglviri£dogvgfqdagossqsqlxrggvglqnvdqrlklhk3ahyhmkidsrpqkg^ 
eiyinrietsz 

mkrssllvrmvisiflvflillalvgtfyyqssssaieatiegnsqttisqtshfiqsyikklettstgltqqtdvlayaenp 
sqdkvegirdlfltilksdk:dlktvvlvtksgqvistddsvqmktssdmmaedwyqka[hqgampvltparksdsqw 

25 visvtqelvdakganlgvlrldisyetleaylnqlqlgqqgfafiinenhefvyhpqhtvysssskmeamkpyidtgqg 
ytpghkstvsqekiagtdwtvlgvssi^kldqvrsqllwtllgasvtsllvclclvwfslkrwuplkdijietmle 
gaqnlrakevgayelrevtrqfnamldqidolmvairsqeettrqyqlqalssqinphflyntldtiiwmaefhdsqr 
vvqvtkslatyfrlalnqgkdliclsdeinhvrqylfiqkqrygdkleyeinenvafdnlvlpklvlqplvenalyhgi 
kekegqghiklsvqkqdsglvirieddgvgfqdagdssqsqlkrggvglqnvdqrlklhfganyhmkidsrpqkgtkv 

30 eiyinrietsz 

MFFKLLR£ALKVKQVRSKILFriFrV^LVFRJGTSrrVPGVNANStKALSGLSFLNMLSLVSGNAl.KNFSIFALGVSPYrrASI 
VVQLLQMDILPKFVEWGKQGEVGRiUCLNQATRYULVl^FVQSIGrrAGFNTLAGAQLIKTALTPQVFLTIGIILTAGSMI 
VTWLGEQrrDKGYGNGVSMnFAGIVSSIPEMIQGIYVDYFVNVPSSRrrSStlFVIILirrVLUIYFrrYVQQAEYKTPIQYTK 
35 VAQGAPSSSYLPUCVNPAGVIPVIFASSrTAAPAAlLQFl^ATGHDWAWVRVAQEMLATTSPTGlAMYALLULFTFFYTF 
VQINPEKAAETYKRVVPISMEFVLVKVQKNICLNFFVVLQLLVPSSLVZ 

MDIRQVT£TIAMIEEQNFDIRTrrMGISLLDCIDPDINRAAEKIYQKriTK.\ANLVAVGDElAAELGIPIVNKRVSVTPISLIG 
AATDATDYVVLAKALDKAAKEIGVDFIGGFSALVQKGYQKGDEILINSrPRALAETDKVCSSVNIGSTKSGINMTAVAD 
40 MGRIIKETANLSDMGVAKLVVFANAVEDNPFMAGAFHGVGEADVIINVGVSGPGVVKRALEKVRGQSFDVVAETVKK 
TAFKITRIGQLVGQMASERLGVEFGrVDLSLAPTPAVGDSVARVLEEMGLETVGTHGTTAALALLNDQVKKGGVMAC 
NQVGGLSGAFIPVSEDEGMIAAVQNGSLNLEKLEAN^AlCSVGLDMlArPEDTPAETIAAMIADEAAIGVINMFTTTAVRII 
PKGKEGDMIEFGGLLGTAPVMKVNGASSVDFISRGGQIPAPIHSFKNZ 

45 MTQIIDGKALAAKLQGQLAEKTAKLKEETGLVPGLVVILVGDNPASQVYVRNKERSALAAGFRSEVVRVPETITQEELL 
DLIAKYNQDPAWHGILVQLPLPKHIDEEAVLLAIDPEKDVDGFHPLNMGRLWSGHPVMIPSTPAGIMEMFHEYGIDLEG 
KNAVVIGRSNIVGKPMAQLLLAKNATVTLTHSRTHNLSKVAAXADILVVAIGRAKFVTADFVKPGAVVIDVGMNRDEN 
GBCLCGDVDYEAVAPLASHITPVPGGVGPMTITMLMEQTYQAALRTLDRKZ 

50 MSKFNRIHLVVLDSVGIGAAPDANNFVNAGVPDGASDTLGHISKTVGLNVPNMAKIGLGNIPRETPLKTVAAESNPTGY 
ATKLEEVSLGKDTMTGHWEIMGLNITEPFDTFWNGFPEEILTKIEEFSGRKVIREANfCPYSGTAVIYDFGPRQMETGELlI 
YTSADPVLQIAAHEDIIPLDELYRICEYARSrrLERPALLGRIIARPYVGEPGNFTRTANRRDLAVSPFFPTVLDKLNEAGI 
DTYAVGKINDIFNGAGINHDMGHNKSNSHGIDTLLKTMGLAEFEKGFSFTNLVDFDALYGHRRNAHGYRDCLHEFDE 
RLPEIIAAMRENDLLLITADHGNDPTYAGTDHTRHYIPLLAYSPAFKGNGUPVGHFADISATVADNFGVETAMIGESFL 

55 DKLVZ 

MFlSISAGIVTFLLTLVEIPAFIQFYRKAQrrCQQMHEDVKQHQAlCAGTPTMGGLVFLITSVLVAFFFALFSSQFSNNVGM 
ILFILVLYGLVGFLDDFLKVFRKINEGLNPKQKLALQLLGGVIFYLFYERGGDILSVFGYPVHLGFFYIFFALFWLVGFSN 
AVNLTDGVDGl^SrSVVISl^AYGVlAYVQGQMDlLLVILAMIGGLLGFFIFNHKPAiCVFMGDVGSLALGGMLAAlSMA 
60 LHQEWTLLIIGrVYVFETTSVMMQVSYFKLTGGKRJFRMTPVHHHFELGGLSGKGNPWSEWKVDFFFWGVGLLASLLT 
LAILYLM2 

LFKKNKDILNlALPAMGENFLQMLMGMVDSYLVAHLGLIAISGVSVAGlsrirnYQAlFlALGAAlSSVISKSIGQKDQSKLA 
YHVTEALKITLLLSFLLGFLSIFAGKEMIGLLCTERDVAESGGLYLSLVGGSIVLLGLMTSLGAURATHKPRLPLYVSFL 
65 SNALNILFSSLAlFVLDMGtAGVAWGTIVSRLVGLVlLWSQLKLPYGKPTFGLDICELLTLALPAAGERI.MMRAGDVVIlA 
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LVVSFOTEAVAGNAlGEVLTQFNYMPAFGVATATVMLLARAVGEDDWKRVASLSKQTFWI^LFLMLPLSFSrYVLGVP 

LTHLYTTDSLAVEASVLVTLFSLLGTPMTTGTVIYTAVWQGLGNARLPFYATSIGMWCIRIGTGYLM 

AGSLLDNGFRWLFLRYRYQRYMSLKG2 

5 MQTQEKHSQAAVLGLQHLLAMYSGSILVPIMIATALGYSAEQLTYLISTDIFMCGVATFLQLQLNKYFGIGLPVVLGVA 
FQSVAPLIMiGQSHGSGAMFGALlASGIYVVLVSGIFSKVANLFPSIVTGSVnrTIGLTLIPVAlGNMGNNVPEPTCQSLLLA 
ArrVUILLIKIFTKGFIKSISlLIGLWGTAIAATMGLVDFSPVAVAPLVHVFTPLYFGMPTFEISSrVMMCIlATVSMVEST 
GVYLAl^DrrKDPIDSTRLRNGYRAEGLAVLLGGIFOTFPYTGFSQNTVGLVKLSGIKKRtPIYYAAGFLVLLGLLPKFGA 
L^QIIPSSVLGGAMLVMFGFVSIQGMQILARVDFANNEHNFLIAAVSIAAGVGLNNSNLFVSMPTAFQMFFSNCIVVASL 

10 LAIVLNAVLNHKKK2 



MKDRIKEYLQDKGKVTVNDLAQALGKDSSKDFRELIKTI^MERKHQIRFEEDGSLTLEIKKKHErrUCGIFHAHKNGFG 
FVSLEGEEDDLFVGKNDVNYAIDGDTVEVVIKKVADRNFCCTAAEAICIIDILEHSLTTVVGQEVLDQEKPKYAGYIRSKN 

15 QiaSQPlYVKKPALKLEGTEVLKVFlDKYPSKKHDFFVASVLDVVGHSTDVGIDVLEVLESMDrVSEFPEAWKEAESVP 
DAPSQKDMEGRLDLRDEirFTIDGADAKDLDDAVHIICALKNGNLEFGVHlADVSYYVTEGSALDKEALNRATSVYVTD 
RVVPMLPERLSNGICSLNPQVDRLTQSAIMEIDBCHGRVVNYTrrQTVIKTSFRMTYSDVNDILAGDEEKRKEYHKIVSSIE 
LMAKLHETLENMRVKRGALNFDTNEAKILVDKQGKPVDIVLRQRGIAERMIESFMLMANETVAEHFSKLDLPHYRIHE 
£PKA£KVQKnDYASSFGLRJYGTASEISQ£ALQDlMRAV£G£PYADVI^MMLLRSMQQARY5£I^ 

20 HFT^PIRRYPDLLVHRMIRDYGRSKEIAEHFEQVIPEIATQSSNRERRAIEAEREVEAMKKAEYMEEYVGEEYDAVVSSIV 
KFGLFV£LP^rrVEGLIHrTNLPEFYHFNERDLTLRGEKSGITFRVGQQ^RIRV£RADiCMTGEIDFSFVPSEFDVIEKGLKQS 
SRSGRGRDSNRRSDKKEDKRKSGRSNDKRKHSQKDKKKKGKKPFYKEVAKKGAKHGKGRGKGRRTKZ 

MGTTGFniDLIILIVYLLAVLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMLSPISFLGLAGSSYAGSWlLWFA 
25 QLGMVVAEPLTIRHLPIFARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSIIMYLPSAGLSVLTGIDINILIILMGVV 
AIVYSYTGGLKSVLWTDRQGVILISGVVLALFVLIANIFCGGFGAVAETLANGKFLAANEBCLFDPNLLSNSIFLIVMGSGF 
TIl^SYASSQDLVQRFTTTQNlKKLNmLFTNGVL^lJ^TATVFYLIGTGLYVFYQVQNADSAASNIPQDQIFMYFIAYQL 
PVGrrGLILAAlYAASQSTISTGLNSVATSWTLDIQDVISKNMSDNRRTKIAQFVSl^VGLFSIGVSrVMAHSDIKSAYEWF 
NSFMGLVLGLLGGVHLGFVSKKANKQGAYAALrVSTIVMVFIKYFLPPTAVSYWAYSLISISVSVVSGYIVSVLTGNKVS 
30 APKYTTIHDITEIKADSSWEVRHZ 

MKFSKKYIAAGSAVIVSLSLCAYALNQHRSQENKDNNRVSYVDGSQSSQiCSENLTPDQVSQKEGIQAEQrVKITDQGYV 
TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVKGGYIIKVDGKYYVYLKDAAHADMVRTKDEINRQK 
QEHVKDNEKVNSNVAVARSCXSRYTTNDGYVFNPADUEDTGNAYIVPHGGHYHYIPiCSDLSASELAAAiCAHLAGKNM 
35 QPSQLSYSSTASDIsrhTTQSVAKGSTSKPANKSENLQSLLfCELYDSPSAQRYSESDGLVFDPAKIISRTPNGVAIPHGDHYHF 
IPYSKLSALEEKIARMVPlSGTGSTVSTNAKPNEVVSSLGSLSSNPSSLTTSKELSSASDGYrFNPKDrVEETATAYIVRHGD 
HFHYIPKSNQIGQPTLPNNSLATPSPSLPINPGTSHEKHEEDGYGFDANRIIAEDESGFVMSHGDHNHYFFKKDLTEEQIK 
VRKNIZ 

40 MKKRAIVAVIVLLLIGLDQLVKSYIVQQIPLGEVRSWIPNFVSLTYLQNRGAAFSILQDQQLLFAVrrLWVIGAfWYLHK 
HMEDSFWMVLGLTLIIAGGLGNFIDRVSQGFVVDMFHLD^NFAIFNVADSYLTVGVrILLIAMLKEEING^fZ 

M^n^NLASFIVGLIIDENDRFYFVQ^CDGQTYALAKEEGQHTVGDTVKGFAYTDMKQKLRLT^LEVTATQDQFGWGRV^ 
EVRKDLGVFVDTGLPDKEIVVSLDILPELKELWPKKGDQLYIRLEVDKKDRIWGLLAYQEDFQRLARPAYNNMQNQN 
45 WPAIVYRLKLSGTFVYLPENNMLGFIHPSERYAEPRLGQVLDARVIGFRHVDRTLNLSLKPRSFEMLENDAQMILTYLE 
SNGGFMTLNDKSSPDDIKATFGISKGQFKKALGGLMICACKIKQDQFGTELIZ 

MKDVSLFLUCICVFKSRLNWIVLALFV'SVLGVTFYLNSQTANSHSLESRLESRIAANERAINENEEKI^QMSDTSSEEYQF 
AKNNLDVQKNLLTRKTEILTLLKEGRWKEAYYLQWQDEEKNYEFVSNDPTASPGLKMGVDRERKIYQALYPLNIKAH 
50 TLEFPTHGIDQIVWILEVIIPSLFVVAIIFMLTQLFAERYQNHLDTAHLYPVSKVTFAISSLGVGVGYVTVLFIGICGFSFLV 
GSLISGFGQLDYPYPIYSLVNQEVTIGKIQDVLFPGLLLAFLAnvrVEVVYLlAYFFKQKMPVLFLSLIGIVGLLFGIQTIQP 
LQRiAHLIPFTYLRSVEILSGRLPKQIDNVDLNWSMGMVLLPCLIIFLLLGILFIERWGSSQKKEFFNRFZ 

MWICnLDIVSTPAIL.VALIAILGtVLQKK:!CLPDIIKGGIKTFVGFLVVSGGAGIVQNSLNPFGTMFEHAFHLSGVVPNNEAI 
55 VAVALTTYGSATAMIMFAGMVFNILIARFTRFKYIFLTGHHTLYMACMIAVILSVAGFTSLPLILLGGLALGIIMSISPAF 
VQKYMVQLTGNDKVALGHFSSLGYWLSGFTGSHGDKSKSTEDIKFPKSLAFLRDSTVSITLSMAVIYnVAlFAGSEYIEK 
EISSGTSGLVYAl-QLAGQFAAGVFVrLAGVRLILGEIVPAFKGISERLVPNSKPALDCPIVYTYAPNAVUGFlSSFVGGLVS 
MVIMIASGTVVILPGVVPHFFCGATAGVIGNASGGVRGATIGAFLQGILISFLPVFLMPVLGGLGFQGSTFSDADFGLSGn 
LGMLNQFGSQAGIVIGLVLILAVMFGVSFIKiCPSATEEZ 

60 

MIKTFLSAL^VlLFSIPIITYSFFPSSNLNWl^QPIl^QIYAFPLATATMAAILSFLFFFLSFYXKNKQIRFYSGILl^ 
LLFGTDKTLSSASNKTBCTLKLVTWNVANQIEAQHIERIFSHFDADMAIFPEI^TNIRGEQENQRIKLLFHQVGL^MANYD 
IFTSPPTNSGtAPVTVrVKKSYGFYTEAXTFHTTRFGTlVLHSRKQNIPDIIALHTAPPLPGLMEIWKQDLNnHNQLA^ 
KAIIAGDFNATMRHGALAKISSHRDALNALPPFERGTWNSQSPK.LFNATIDHILLPICNHYYVKDLDrVSFQNSDHRCIFT 
65 EITFZ 
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MNPIQRSWAYVSRKRLRSFlLFLILLVUJ^GISACLTLMKSNKTVESNLYKSL^^^FSImENCQT^ 
ENVSPELETVAKUCDKEAVTGEQSVERDDLSAADNNLVSLTALEDSSKDVTFTSSAFNUCEGRHLQKGDSK^ 
AKKNGLSLHDKiGLDAGQSESGKGQTVEFEIIGIFSGBCKQEKFTGLSSDFSENQVFTDYESSQTLLGNSEAQVSAARFYVE 
5 NPKEMDGLMKQVENU^LENQGYQVEKEhTKAFEQIJCDSVATFQTFLTIFLYGMLIAGAGAULVI^LWLRERVYEVGIL 
IJ^LGKGICSSIFLQFCLEVVLVSLGALLPAFVAGNArTTYLLQTIXASGDQASLQDTlJVKASSLS^ 
CLSVALCFLFLFRKSPKEILSSISZ 

MLHNAFAYVTRKFFKSrviFUILLMASLSLVGl^IKGATAKASQETFKNrrNSFSMQINRRVNQGTPRGAGNKGEDIKKI 
10 TENKAlESYVKRINAIGDLTGYDLIETPETKKNLTADRAKRFGSSUvlITGVNDSSKEDiCFVSGSYKLVEGEHLTNDDKDK 
ILLHKDLAAKHGWKVGDKVKLDSNIYDADNEKGAKETVEVTIKGLFDGHNKSAVTYSQELYEOTAn-DIHTAAKLYG^ 
TEDTAIYGDATFFVTADKMLDDVMKELNGiSGI^^VKSYTLV^CSSSNYPALEQSISGMYKMANU-FWGSLSFSVLLLALL 
I^LWINARRKEVGILLSIGLKQASILGQFn-ESILlAlPALVSAYFLANYTARAlGm^LANVTSGVAK 
GAEVDGFSKTLSSLDISIQTSDFIIIFVLALVLVVLVMALASSNLLRKQPKELLLDGE2 

15 

MSQDKQMKAVSPLLQRVINISSIVGGVGSLIFCIWAYQAGILQSKETLSAFIQQAGIWGPPLFIFLQILQTVVPnPGALTSV 
AGVFIYGHIIGTIYNYIGIVIGCAIIFYLVRLYGAAFVQSVVSKRTYDKYIDWLDKGNRFDRFFIFMMIWPISPADFLCMLA 
ALTKMSFKRYMTIIILTKPFTLVVYTYGLTYIIDFFWQMLZ 

20 MRNMWW^KHTyLRKVHSWSFFFMV^SPFLFLGISVGIGHL<5G5SMAK^WKVAVVTTVPSVAEGLKNVt^ 

ASAKEAIKEEiaKGYLTIDQEDSVLKAVYHGETSLENGIKFEVTGTLNELQNQLNRSTASLSQEQEKRLAQTIQFTEKIDE 
AKENKKRQTIAAGALGFFLYMILITYAGVTAQEVASEKGTKIMEVVFSSIRASHYFYARMMALFLVILTHIGnn^ 
AAVLLFKDLPFL^QSCILDHLCDAISLOTLLFIUSLFMYVVl^AFLGSMVSRPEDSGKALSPLMIUMGGFFGVTALGAA 
GDNLLLKIGSYIPFISTFFMPFRTINDYAGGAEAWISLArrviFAVVATGFIGRMYASLVLQTDDLGIWKTFKRALSYKZ 

25 

MTETIKLMKAHTSVRRiTCEQEIPQVDLNEILTAAQMASSWKNFQSySVlVVRSQEKKDALYELVPQEAIRQSAVFLL^^ 
GDLNRAEKGARLHTDTFQPQGVEGLLISSVDAALAGQNALLAAESLGYGGVnGLVRYKSEEVAELFNLPDYTYSVFG 
MALGVPNQHHDMKPRLPLENVVFEEEYQEQSTEAIQAYDRVQADYAGARATTSWSQRLAEQFGQAEPSSTRKNLEQK 
KLLZMLKLIAIVGTNSKRSTNRQLLQYMQKHFTDiCAEIELVEIKAJPVFNKPADKQVPAEILElAAKIEEADGVUGTPEYD 
30 HSIPAVLMSALAWLSYGIYPLLMKPIMrrGASYGTLGSSRAQtQLRQILNAPEIKANVLPDEFLLSHSLQAFNPSGDLVDL 
DVDOCLDAIFDDFRIFVKITEKLRNAQELLRKDAEDFDWENLZ 

MNTYQLNNGVEIPVLGFGTFKAKDGEEAYRAVLEALKAGYRHIDTAAIYQNEESVGQAIKDSGVPREEMFVTTKLWNS 
OQTYEQTRQALEKSIEKLGLDYLDLYLIHWPNPKPLRENDAWKTRNAEVWRAMEDLYQECKIRAIGVSNFLPHHLDAL 
35 LETATIVPAVNQVRLAPGVYQDQVVAYCREKGlLLEAWGPFGQGELFDSKQVQElAANHGKSVAQlAtAWSLAEGFLP 
LPKSVTTSRlQANLDCFGIELSHEERETLiCTlAVQSGAPRVDDVDFZ 

MRCKMLDPlAlQLGPlJ>iIRWYALCIVTGULAVYLTMKEAPRKKIIPDDlLDFILVAFPLAILGARLYYVIFRFDyYSQNLG 
EIFAlWNGGLAIYGGLrrGALVLYIFADRKLINTWDFLDL^APSVMlAQSLGRWGNFFNQEAYGATVDNLDYLPGnRDQ 
40 MYIEGSYRQPTFLYESLWNLLGFALILIFRRKWKSLRRGHITAFYLIWYGFGRMVIEGMRTDSLMFFGFRVSQWLSVVLI 
GLGIMrvIYQNRKKAPYYITEEENZ 

MGKLSSILLGTVSGAAL^LFLTSDKGKQVCSQAQDFLDDLREDPEYAKEQVCEKLTEVKEQATDFVLKTKEQVESGErr 
VDSILAQTKSYAFQATEASKNQLNNLKEQWQEKAEALDDSEEIVIDITEEZ 

45 

MKTKLIFWGSMLFLL^LSILLTrYLAWIFt'PMEIQWLNLTKRVYLKPETIQYNFHlLMNYLTNPFSQVLQMPDFRSSAAG 
LHHFAVVKNLFHLVQLVALVTLPSFYVFVNRiVKKDFLSLYRKSLLALVVLPVMIGLGGVUGFDQFFTLFHQILFVGD 
DTWLFDPAKDPVIMILPETFFLHAFLLFFALYENFFGYLYLKSRRKZ 

50 MTYHFTEEYDUVIGAGHAGVEASLAASRMGCKVLU^TINIEMLAFMPCNPSIGGSAKGIWREVDALGGEMAKTIDCT 
YIQMKMLOTGKGPAVRALRAQADKELYSKEMRKTVENQENLTLRQTMIDEILVEDGKVVGVRTATHQEYAAKAVIVT 
TGTALRGEIIIGDLKYSSGPNHSLASINLADNLKELGLEIGRFICTGTPPRVKASSINYDVTEIQPGDEVPNHFSYTSRDEDY 
VKDQVPCWLTYTNGTSHEIIQNNLHRAPMFTGVViCGVGPRYCPSlEDKIVRFADKERHQLFLEPEGRNTEEVYVQGLST 
SLPEDVQRDLVHSIKGLENAEMMRTGYAIEYDMVLPHQLRATLETKKISGLFTAGQTNGTSGYEEAAGQGIIAGINAAL 

55 laQGKPELILKRSDGYlGVMIDDLVTKGTIEPYRLLTSRAEYRLILRHDNADMRLTEMGREIGLVDDERWARFEIKKNQF 
DNEMKRLDSIKLKPVKETNAKVEEMGFKPLTDAVTAKEFLRRPEVSYQDVVAFIGPAAEDLDDKIIELIETEUCYEGYISJC 
AMDQVAKMKRMEEKRiPANIDWDDlDSlATEARQKFKLINPETIGQASRISGVNPADISILMVYLEGKNRSISKTLQKSKZ 

MTKQVLLVDDEEHILKLLDYHLSKEGFSTQLVTNGRKALALAETEPFDFILLDIMLPQLDGMEVCKRLRAKGVKTPIM 
60 MVSAKSDEFDKVLALELGADDYLTKPFSPRELLARVKAVLRRTKGEQEGDDSDNIADDSWLFGTLKVYPERHEVYKA 
NKLLS LTPKEFESDKNPFFEVFK VSK VT AQZ 

MTTFKDGFLWGGAVAAHQLEGGWQEGGKCISVADVWTAGRHGVARErTLGVLEGKYYPNHEAlDFYHRYKEDlALF 
AEMGFKCFRTSIAWTRIFPKGDELEPNEEGLQFYDNLFDECLKNGIEPVITLSHFEMPYHLVTEYGGWKNRKUDFFARF 
65 AEVVFKRYKDKVKYWMTFKEINNQANYQEDFAPFTNSGIVYEEGDNREAIMYQAAHYELVASARAVKIGHEINPDFQI 
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GCMUMCPIYPVTCNPKDILMAMKAMQKRYYFADVKVLGKYPEHIFKYWERKGISVDFTAQDKEDLLGGTVDVIGFS 
YYMSFAIDSHRENNPYFDYLETEDLVKNNYVKJ\SEWEWQ^DPEGLRYAL^^VF^^HYHLPLFrVENGFGA^ 
MVHDDYRIEYLGAHIREMKKAWEDGVDLMGYTPWGaDLVSAGTGEMRKRYGFlYVDKDDNGKGSYNRSPKKSFG 
WYKEVrSSNGESVEZ 

5 

MEKJQNGLFGFLENHVMGPMGKlJikQFKVVRAITAAGMAAVPFrrv'GSMFLVFSILPQAFSFWPr^ 
MVANYATMGSLSLYFVI^LAYELTKIYAEEEELNMNPLNGALLALMAFVNm^PQnFDGGMMKTVTSLKE^ 
WAMG^^WAIU=GTTGIF^AIIMA^VTVLIYRMCVKHNWVIKMPEAVPEGVSRGFTALVPGFVVAFV 
10 DIFKVUIPFGFVSNLTNSWIGLMnYLLTQLLWrVGIHGANIVFAFVSPlALANMAENAAGGHFAVAGEFSNMFVIAGGS 
GATLGLCLYIAFASKSEQIJCAIGRASWPALFNINEPUFGLPIIYNPALAIPFIU^PMVTATIYYVANSLNFK^ 
TPVGIGAFLGTADLRAVLVALVCAFAAFLVYLPFIRVYDQKLVKEEQGIZ 

MKKFYVSPIFPILVGLIAFGVLSTFIIFVNNNLLTVLILFLFVGGYVFLFKKLRVHYTRSDVEQIQYVNHQAEESLTALLEQ 
15 MPVGVMKLNLSSGEVEWFNPYAELILTKEDGDFDLEAVQTIIKASVGNPSTYAKLGEKRYAVHMDASSGVLYFVDVSR 
EQAITDELVTSRPVIGIVSVDNYDDLEDETSESDISQINSFVANFISEFSEKHMMFSRRVSMDRFYLFTDYTVLEGLMNDK 
FSVIDAFREESKQRQLPLTLSMGFSYGDGNHDEIGKVALLNLNLAEVRGGDQWVKENDETKNPVYFGGGSAASIKRT 
RTRTRAMMTAISDKIRSVDQVFVVGHKNLDMDALGSAVGMQLFASNVIENSYALYDEEQMSPDIERAVSHEKEGVTK 
LLSVKDAMGMVTNRSLULVDHSKTALTLSKEFYDLFTQTIVIDHHRRDQDFPDNAVrrYIESGASSASELVTELIQFQNS 
20 KKNRi^RMQASVLMAGMMLDTKNFTSRVTSRTFDVASYLRTRGSDSIAIQE^AATDFEEYREVNEULQGRiCLGSDV^^ 
AEAKDMKCYDTVVISKAADAMLAMSGIEASFVLAKOTC^FISISARSRSKLNVQRIMEELGGGGHFNLAA^ 
EAGEKLTEIVLNEMKEKEKEE2 

MKEKNMWKELLNRAGWILVFLLAVLLYQVPLVVTSILTLKEVALLQSGLIVAGLSIVVLALFIMGARiCTKLASFNFSFF 
25 RAKDLARLGLSYLVrVGSNlLGSILLQL^NETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGIVPKKIFRGKENL 
GFVVGTIVFALLHQPSNLPSLUYGGMSTVLSWTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MKEKNMWKELLNRAG^^^LVFLLAVlJLYQVPLVVTSILTLKEVALLQSGLIVAGLS^VVLALFIMGARKTKLASFNF^ 
RAKDLARLGLSYLVIVGSNILGSILLQL^NETTTANQSQINDMVQNSSLISSFFLLALLAPICEEILCRGrVPKKIFRGKENL 
30 GFVVGTT^FALLHQPSNLPSLLIYGGMSTVI^WTAYKTQRLEMSILLHMIVNGIAFCLLALVVIMSRTLGISVZ 

MDTQKIEAAVKMIIEAVGEDANREGLQETPARVARMYQEIFSGLCQTAEEHLSKSFEIIDDNMVVEKDIFFHTMCEHHF 
LPFYGRAHIAYIPDGRVAGLSKLARTVEVYSKKPQIQERLNIEVADALMDYLGAKGAFV'VIEAEHMCMSMRGVRKPGT 
ATLTTVARGLFETDKDLRDQAYRLMGLZMKDLFLKRKQAFRKECLGYLRYVLNDHFVLFLLVLLGFLAYQYSQLLQH 
35 FPENHWPILLFVGITSVLLLLWGGTATYMEAPDKLFLLVGEEEIKLHLKRQTGISLVFWLFVQTLFLLLFAPLFLAMGY 
GLPVFLLYVLLLGVGKYFHFCQKASKFFTETGLDWDYVISQESlCRKQVLLRFFALFTQVKGlSNSVKRRAYLDFrLKAV 
QKVPGKIWQNLYLRSYLRNGDLFALSLRLLLLSLLAQVFIEQAWIATAVVVUFNYLLLFQLLALYHAFDYQYLTQLFPL 
DKGQKEKGLQEVVRGLTSFVLLVELVVGLITFQEKLALLALLGAGI-VLLVLYLPYQVKRQMQD2 

40 MRKSrVlJVADNAYUPLErriKSVLYHNRDVDFYILNSDIAPEWFKLLGRKMEVVNSTIRSVHIDKELFESYKTGPHINYA 
SYFRFFATEWESDRVLYLDSDIIVTGELATLFETDLKGYSIGAVDDVYAYEGRKSGFNTGMLLMDVAKWKEHSIVNSL 
LELAAEQNQVVHLGDQSILNIYFEDNWLALDKTYNYMVGIDrYHLAQECERLDDNPPnVHYASHDKPWNTYSISRLRE 
LWWVYRDLDWSEIAFQRSDLNYFERSNQSKKQVMLVTWSADIKHLEYLVQRLPDWHFHLAAPCDCSEELTSLSQYTN 
VTVYQNVLHSRIDWLLDDSIVYLDIOTGGEVFN\ATRAQESGKKIFAFDrrRKSMDDGLYDGrFSVERPDDLVDRM 

45 EIE2 

MTKIYSSIAVICKGLFTSFLLHYVLGSRIILPFVDLNTKDFLGGSTAYLAFSAALTGGNLRSLSIFSVGLSPWMSAM 
MFSFSKRLGLTSTSIEIQDRRKMYLTLLUVIQSLAVSLRLPVQSSYSAILVVLMOTILLIAGTFFLVWLSDLNASMGIGGSI 
VlLLSSMVLNIPQDVLETFQTVHIPTGIIVLLALLTLVFSYLLAUvIYRARYLVPVNKIGLHNRFKRYSYLEIMLNPAGGMP 
50 YMYVMSFLSVPAYLnLLGnFPNHSGUVALSKEFMVGKPtWVYVYISVLFLFSIlFAFVTMNGEElADRMKKSGEYIYCI 
YPGADTSRHNRLVLRFSVIGGLFhrviMAGGPMLFVLFDEKLLRLAMIPGLFMMFGGMtFTIRDEVKALRLNETYRPLIZ 

MSSt^DQELVAKTVEFRQRI^EGESLDDILVEAFAVVREADKRILGMFPYDVQVMGArVMHYGNTVAEMNTGEGKTLT 
ATMPVYLNAFSGEGVMVVTPNEYLSKRDAEEMGQVYRFLGLTIGVPFTEDPKKEMKAEEBaCLIYASDIIYTTNSNLGFD 

55 YLNDNLASNEEGKFLRPFNYVnDEIDDILLDSAQTPLnAGSPRVQSNYYAIIDTLVTTLVEGEDYIFKEEKEEVWLTTKG 
AKSAENFLGIDNLYKEEHASFARHLVYAIRAHKLFTKDKDYIIRGNEMVLVDKGTGRLMEMTKLQCGLHQAIEAKEHV 
KI^PETRAMASITYQSLFKMFNKISGMTGTGKVAEiCEFiETYNMSVVRIPTNRPRQRIDYPDNLYrrLPEKVYASLEYIKQ 
YHAKGNPLLVFVGSVEMSQLYSSLLFREGIAHNVLNANNAAREAQIISESGQMGAVTVATSMAGRGTDIKLGICGVAEL 
GGLrVIGTERMESQRIDLQIRGRSGRQGDPGMSKFFVSLEDDVlKKFGPSWVHKKYKDYQVQDMTQPEVLKGRKYRKL 

60 VEICAQHASDSAGRSARRQTLEYAESMNIQRDIVYKERNRLIDGSRDLEDVVVDIIERYTEEVAADHYASRELLFHFIVTN 
ISFHVKEVPDYIDVTDKTAVRSFMKQVIDKELSEKKELLNQHDLYEQFLRLSLLKAIDDNWVEQVDYLQQLSMAIGGQS 
ASQKNPrVEYYQEAYAGFEAMKEQlHADMVRNLLMGLVEVTPKGEIVTHFPZ 

MIGTFAAALVAVLANFIVPIErrPNSAOTELAPPDGlGQVLSNLLLKLVDNPVNALLTANYIRILSWAVIFGLAMREASKNS 
65 QELLKTIADVTSKIVEWIINLAPFGILGLVFKTISDKGVGSLANYGILLVLLVTTMLFVAPVVNPLIAFFFMRRNPYPLVW 
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NCUlVSGVTAFF^lSSATNIPVNMKLCHDLGLNPDTYSVSIPLGSTINMAGVAm^a.LTLAAV^^ 
VAAISSCDASGlAGGSLLLIPVACSLFGISNDL^IQrVGVGFVIGVIQDSCETALKSSTDVLFTAVAEYAATRKKZ 



5 MSISQRTTKLIlJVTClACUJ>LYFLm-SSAVSAGIIALLSLSDTRRSTLia^ 

I^FYVPI^YKMGWEIGTTPSTVLVSHU,VQESTSPDU,VNEFLLFAIGTGFALLVNLYMPSR£EEIQHYHTLVEEK^ 

LQRFKYYLSRGIX}RNR.^QLVAELDTIXKEALRLVYLDHSDHLFHQTDYHIHYFEMRQRQSRlLRNMAQQl>n*CH^ 

ESULAQLFSKUGQLSQTNPASDLLDEIERYLEVFRNRSLPKTREEFETRATLLOLLREAKTnQVKVDFYQKYRQZ 

10 MEIMSLAIAVFAVIIGLVIGYVSISAKMKSSQEAAELMLLNAEQEATNLRGQAEREADLLVNEAKRESKSLKKEALLEAK 
EEARKYREEVDAEFKSERQELKQIESRLTERATSLDRKDDNLTSKEQTLEQKEQSISDRAKNLDAREEQLEEVERQICEAE 
LERIGAL^QAEARDULAQTEE^a.TREIASRIREAEQEVKERSDKMAKDILVQAMQRIAGEYVAESTNSTVHLPDDTMKG 
RIIGR£GRN^RTFESLTGVDVlIDDTPEVVTLSGFDPIRR£URMTMEMLLKIX}RIHPARIEELVEKNRQEID^^IR£YG 
AAYEIGAP^^-HPDLMKIMGRLQFRTSYGQNVLRHSIEVAKIJ^GUMASELGENAALARRAGFLHDIGKAIDHEVEGSI^ 

15 IGMELARKYKEPPVVVNTUSHHGDVEAESVIAVIVAAADALSAARPGARSESLESYIKRLHDLEEIANGFEGVQTSFAL 
QAGR£IRIMVNPGKIKDDK\TILAHKVRKKIENNLDYPGNIKVTVIR£LRAVDYAKZ 

MMLKPSIDTLLDKVPSKYSLVILEAKRAHELEAGAPATQGFKSEKSTLRALEEIESG^m•IHPDPEGKREAVRRRIEE£KR 
RKEEEEKKOCEQIAKEKEDGEKIZ 

20 

MSAYQLPTVWQD£ASNQGAFTGLNRPTAGARFEQNLPKGEQAFQLYSLGTPNGVKVTILLEELLEAGFK£.\AYDLYKI 
AIMDGDQFGSDFVKLNPNSKIPALLDQSCTENVRVFESAHILLYLAEKFGAFLPSNPVEKVEVLNWLFWQAGAAPFLG 
GGFGHFFNYAPEKLEYPINRPTMEVKRQLDLLDKELAQKPYiAGNDYTlADlAlWSWYGQLVQGNLYQGSAKFLDASS 
YQNLVKWAEKIANRPAVKRGLEVTYTEIKZ 

UtSLrrSIIMFYVGFDVLRDTIQKII^REETVIDPLGATLGIISAAlMFVVYLYm-Rl^KKSNSNALKAA^^ 
GTAL^ILASSFNYPIVDKLVAlirrFHLKTAYDIHESSFSLSDGFDDRLLEDYQKAIMEIPKISKVKSQRGRTYGSNEYLDrr 
LEMNPDLSVFESHEIADQVESMLEERFGVFDTDVHIEPAPIPEDEILDNVYKKLLMRHQUDQGNQLEELLTDDFVYIRQ 
DGEQMDKEAYKTKKELNSAIKDIQn-SISQKTKLICYELDGIlHTSrWRRHETWQNIFHQETKKEZ 

30 

MTnCLVATDMDGTFLDGNGRFDMDRLKSLLVSYKEKGIYFAVASGRGFLSLEKLFAGVRDDUFlAENGSLVEYQGQDL 
YEATMSRDFYU^TFEKLKTSPYVDINKl-LLTGKKGSYVLDTVDETYLKVSQHYNENIQKVASLEDrrDDIFKFITNFTEE 
TLEDGEAWVNE^^VPGViCAi^^GFESID^VLDYVDKGVAIV£LVKKLGr^MDQV^^AFGDNLNDLHMM 
NARPEZLELAKTVIGHHKERSVIAYMEGL2 

35 

MADIKLIALDLDGTLLTTDKRLTDRTKETLQAARDRGIKVVLTTGRPLKAMDFFLHELGTDGQEDEYTITFNGGLVQK 
OTGEILDKTVFSYDDVARLYEETEKLSLPLDAISEGTVYQIQSDQESLYAKFNPALTFVPVDFEDLSSQMTYNKCVTAFA 
QEPLDAAIQKISPELFDQYEIFKSREMLLEWSPKNVHKATGLAKLISHLGIDQSQVMACGDEANDLSMIEWAGLGVAM 
QNAVPEVKAAANVVTPMTKDEEAVAWAIEEYVLKEN2 

MESLLILLLIANLAGLFLnVQRQDRQEKHLSKSLEDQADHLSDQLDYRFDQARQASQLDQKDLEVVVSDRLQEVRIELH 
QGLTQVRQEMTDNLLQTRDKTDQRLQALQESNEQRLEQMRQTVEEKLEKTLQTRLQASFETVSKQLESVNRGLGEMQ 
TVARDVGALNKVLSGTKTRGILGELQLGQIIEDIMTPAQYEREYATVENSSERVEYAIKLPGQGDQEYVYLPIDSKFPLA 
DYYRLEEAYETGDKDEIERCRKSLLASVKRFARDIRNKYiAPPRTTNFGVLFVPTEGLYSEIVRNPVFFDDLRREEQIIVA 
45 GPSTL^ALLNSLSVGFKTLNtQKSADHISKTLASVKTEFGKFGGILVKAQKHLQHASGNIDELLNRRTlAIERTLRHIELSE 
GEPALDLLHFQENEEEYEDZ 

MKISHMKKDELFEGFYLIKSADLRQTRAGKNYLAFTFQDDSGEIDGKLWDAQPHNIEAFTAGKVVHMKGRREVYNNT 
PQVNQITLRLPQAGEPNDPADFKVKSPVDVKEIRDYMSQMIFKIENPVWQRlVRNLYTKYDiCEFYSYPAAKTNHHAFET 
50 GLAYHTATM'VRLaDaISEVYPQLNKSLLYAGIMLHDLAKVIELTGPDQTEYTVRGNLLGHIALIDSEITKTVMELGIDDT 
KEEVVLLRHVII^HHGLLEYGSPVRPRIMEAEnHMlDNLDASMMMMSTALALVDKGEMTNKIFAMDhfRSFYKPDLDZ 

MSEKAKKGFKMPSSYTVLLIIlAIMAVLTWFIPAGAHEGtYETQPQNPQGIWDVLMAPIRAMLGTHPSEGSLIKETSA^ 
VAFnLMVGGFLGIVNKTGALDVGlASrVKKYKGREKMLlLVLMPLFALGGTTYGMGEETMAFYPLLVPVMMAVGFDS 
55 LTGVAnLLGSQIGCLASTLNPFATGlASATAGVGTGDGIVLRLIFWVTLTALSTWFVYRYADFCIQKDPTKSLVYSTRKED 
LKHFNVEESSSVESTLSSKQKSVLFLFVLTHLMVI^HPWTDLGVTIFDDFNTWLTGLPVIGNIVGSSTSALGTWYFPEG 
AMLFAFMGILICVIYGLKEDKIISSFMNGAADLLSVALIVAlARGIQVIMNDGMrrDTILNWGKEGLSGLSSQVFIVVTYIF 
YLPMSFLIPSSSGLASATMGIMAPLGEFVNVRPSLin"AYQSASGVLNLlAPTSGIVMGAlJ»LLGRINIGTWWKFMGKLVVA 
HVVTIALLLLGTFLPFLZ 

60 

MSNSFVKLLVSQLFANUVDIFFRNTTIIANIYIISKSVIATSLVPIUGISSFVASLLVPLVTKRIJVLNRVLSL^QFGKTILI^^ 
VGMFI^MQSVAPLVTYLFWAISILDGFAAPVSYAIVPRYATDLGKANSALSMTGEAVQLIGWGLGGLLFATIGLLPTT 
CINLVLYIISSFLMLFLPNAEVEVLESETNLEIIXKGWKLVARNPRLRLFVSANLLEIFSNTIWVSSIILVFVTELLNKTESY 
WGYSNTAYSlGinSGLIAFRl^EKFLAAKWEPQLFTPNLKTIQNPClJSLDPGWFLFSPNGCFLLDKKEFPLYGlSVEKNTK 
65 RKETHMNSLPNHHFQNKSFYQLSFDGGHLTQYGGLIFFQELFSQLICLKERISKYLVTNDQRRYCRYSDSDILVQFLFQLL 
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TGYGTDYACKELSADAYFPKlXEGGQLASQITLSRFI^RTDEETVHSLRCLNLELVEFFLQFHQLNQLrVDID 
GKQEGVAYNAHYRAHGYHPLYAFEGCTGYCFNAQLRPCNRYCSEEADSFrrPVLERFHQU,FRMDSGFATPKLYDLIE 
KTGQYYl^LKKmVLSRLGDLSLPCPQDEDLTILPHSAYSETLYOAGSWSHKRRVCQFSERKEGNLFYDVISLVTNMTS 
GTSQDQFOLYRGRGQAENFIKEMKEGFFGDKTDSSTUK^JEVRMMMSCIAYNLYLFLKHLAGGDFQTLTIKRFRHLFL 
5 HVVGKCVRTGRKQLLKLSSLYAYSELFSALYSRJRKVNLNLPVPYEPPRRKASLMMHZ 

MMEFFQQLPHLEPYGNPQYFVYVlAATLPIFIGU^FICKRPAWYEVLVSLFRVTMLVGGCTNQIJSu^LGIYL^^ 
YKHYRKSKTCKWVFYLVSFLSLLPIIFVKVQPAINGTQSLLGFLGISYLTFRSVGIVIELRIXjVIKDFTLWEFLRFLLF 
FSSGPIDRFKRFNENYQAIPERDELMDMLDESVRYIMWGFLYKFILAHVLGETLLPPLKNLALQSGGFFNLYALAVMYT 
10 FGLELFFDFAGYSMFAUVJSNLMGIRSPINFNKPFLSRDLKEFWNRWHMSi-SFWFRDFVFMRMVMVLTRKKVFKNRN 
VTSSMAYIVNMLIMGFWHGVTWYYIAYGLFHGLGLVINDAWVRXKKTLNKERKKAGKAAIJ>ENRWI^ 
WMLSFLIFSGFLNNLWFKKZ 

MLKRLWMIFGPVLIAGLLVFLLIFFV^PTEMHHNLGAEKRSAVATTIDSFKERSQKVRAL^DPWRFVPFFGSSEWLRFD 
15 GAHPAVLAEKYNRSYRPYLLGQGGAASLNQYFGMQQMLPQLENKQVVYVtSPQWFSKNGYDPAAFQQYFNGDQLTS 
FUCHQSGDQASQYAATRLLQQFPNVAMKDLVQKLASKEEi^TADNEMIELLARFNERQASFFGQFSVRGYVNYDKHV 
AKYLKILPDQFSYQAIEDVVKADAEKOTSNNEMGMENYFYNEQIKKDLiCKLKDSQKSFTYLKSPEYNDLOLVLTQFSK 
SKVNPIHIPPVNKKWMNYAGLREDMYQQTVQKIRYQLESQGFTNIADFSKDGGEPFFMKDTIHLGWLGWLAFDKAVD 
PFLSNPTPAPTYHLNERFFSKDWATYDGDVKEFQZ 

MEKNLKALKQTTDQEGPAIEPEKAEDTKTVQNGYFEDAAVKDRTL^DYAGNWQSVYPFLEDGTFDQVFDYKAKLTG 

KN^QAEYK>YYTKGYHTDVTKINITDOTMEFVQGGQSKKYTYKYVGKKrLTYKiCGNRGVRFLFEATDADAGQF^ 

QFSDHNVAPVKA£HFH!FFGGTSQEALFEEMDNWPTYYPDNLSGQEIAQEMLAH2 

25 MKDGHLLAHHIRLLNGRIFQKLLSQDPEALYRGEQGKII^VLWNSETGCATATDlALATGLANOTLTTMrKKLEEQKL 
VIVSPCGKDKRKKYLVLTELGBCSQFCEVGHRVSQfCLDTIFYKGFSEEEIHQFEGFQERILANLKEICGNEVZ 

MTNLIATFQDRFSDWLTALSQHLQL^LLTLLLAILLAIPLAVFLRYHEKLADWVLQIAGIFQTIPSLALLGLFIPLMGIGTL 
PALTALVIYAIFPILQ^^•rTGLKGIDPNLQ£AGlAFGMTR^VERLKKFEIPLAMPVIMSGIRTAAVUIGTATLAALIGAGGL 
30 GSFILLGIDRNNASLILIGALSSAVLAIAFNFLLKVMEKAKLRTIFSGFALVALLLGLSYSPALLVQKEKENLVIAGICIGPEP 
EILANMYKIXIEEha*SMTATVKPMFGCTSFLYEALKKGDIDrYPEFrGTVTESLLQPSPKVSHEPEQVYQVARDGUK^ 
HLAYLKPMSYQNTYAVAVPKfCIAQEYGLKTISDLKKVEGQLKAGFTLEFNDREDGNKGLOSMYGLNLNVATIEPALRY 
QAIQSGDIQrTDAYSTDAELERYDLQVLEDDKQLFPPYQGAPLMKEALLKKHPELERVLNTLAGKrTESQMSQLNYQVG 
VEGKSAKQVAKEFLQEQGLLICKZ 

MMHTYLQiCKIENIKTTLGEMSGGYRRMVAAMADLGFSGTMKAlWDDLFAHRSFAQWIYLLVLGSFPLWLELVYEHRI 
VDWIGMICSLTGIICVIFVSEGRASNYLFGLINSVIYLILALQKGFYGEVLTTLYFTVMQPIGLLVWIYQAQFBCKEKQEFV 
ARKLDGKGWTKYLSISVLWWLAFGFIYQSIGANRPYRDSITDATNGVGQILMTAVYREQWIFWAATNVFSIYLWWGES 
LQIQGKYUYLINSLVGWYQWS^CAAKQ^a•DLLNZ 

40 

_ MRNMJCAKYAVWVAFFLNLTYAIVEFIAGGVFGSSAVLADSVHDLGDAIAIGISAFLETISNREEDNQYTLGYKRFSLLG 
ALVTAVILVTGSVLVILENVTKILHPQPVNDEGILWLGIIAITINLLASLVVGKGKTKNESILSLHFLEDTLGWVAVILMAI 
VLRJTDWYILDPLLSLVISFFELSKALPRFWSTLKIFLDAVPEGLDIKQVKSGLERLDNVASLNQLNLWTMDALEKNAiV 
HVCLKEMEHMETCKESIRiFLKDCGFQNITIEIDADLETHQTHKRKVCDLERSYEHQHZ 

45 

MIEYKNVALRYTEKDVLRDVNLQIEDGEFMVLVGPSGSGKTTMLiCMINRLLEPTDGNIYMDGICRIKDYDERELRLSTG 
YVLQAlALFPNLTVAENULIPEMKGWSKEEITKKTEELLAKVGLPVAEYGHRLPSELSGGEQQRVGrVRAMIGQPKIFL 
MDEPFSALDAISRKQLQVLTKELHKEFGMTTIFVTHDTDEALKLADRIAVLQDCEIRQVANPETILICAPATDFVADLFG 
GSVHDZ 

50 

MSAVAISAMTKVMQETHGNPSSIHGHGRQAGKLLREARQEl^QLLRTKPQHIFFTSGGTEGNNriTIIGYCLRHQEQGKH 

rnTAIEHHAVLETIDYLVQHFGFEATnQPENQErrAQQIQKALRDDTILVSTMFVNNETGNLLPlAEIGQlLKQHPAAYH 

VDAVQAlGKrPIHSEELGIDFLTASAHKFHGPKGIGFLYASSMDFDSYLHGGDQEQKKRAGTENLPAlVGMVAALtCEDL 

, EKQEEHFQHVQNLETAFLAELEGIQYYLNRGKHHLPYVLNIGFPGQKNDLLLLRLDLAGISISTGSACTAGVVQSSHVLE 

D5 AMYGANSERLJCESLRISLSPQNTVEDLQTLAKTLKEIIGGZ 



MLFKLSKEKIELGLSRLSPARRIFLSFALVILLGSLLLSLPFVQVESSRATYFDHLFTAVSAVCVTGLSTLPVAHTYNIWG 
QIICLLLIQiGGLGLMTFIGVFYIQSKQKLSLRSRATIQDSFSYGETRSLRKFVYSIFLTTFLVESLGAlLLSFRLIPQLGWGR 
50 GLFSSIFLAISAFCNAGFDNLGSTSLFAFQTDLLVNLVIAGLIITGGLGFMVWFDLAGHVGRKKKGRLHFHTKLVLLLTI 
GLLLFGTATTLFLEWNNAGTIGNLPVADKVLVSFFQTVTMRTAGFSTIDYTQAHPVTLLIYILQMFLGGAPGGTAGGLK 
rrTFFVLLVFARSELLGLPHANVARRTlAPRTVQKSFSVFIlFLMSFLIGLILLGITAKGNPPFlHLVFETISALSTVGVTANL 
TPDLGKLALSVIMPLMFMGRIGPLTLFVSLADYHPEKKDMIHYMKADISIGZ 

65 
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MSDRTIGILGLGIFGSSVLAAL^QDMNIlAlDDHAERINQFEPVLARGVIGDrrDEELLRSAGIDTCDTVVVATGENLESS 

VI^VMHC^CSLGVI^VIAKVKSQTAKKVLEKIGADSVISPEyEMGQSLAQTILFHNSVDVFQLDK^rVS^VEMKIPQSWA 

QSLSiCLDLRGKYNLNILGFREQENSPLDVEFGPDDLLIC^DTYlLAVINNQyLDTLVALNSZ 

MKLL^IAISSYNAAAYLHYCVESLVIGGEQVGIUINDGSQDQTQElAECLASKYPNIVRAIyQE^^KCHGCAVNRGLVEAS 
GRYFKVVDSDDWVDPRAYLKILETLQELESKGQEVDVFVTNFVYEKEGQSRKKSMSYDSVLPVRQIFGWDQVGNFSK 
GQ YTMMHSLIYRTDLLRASQFZ 



MKFNPNQRYTRWSIRRI^VGVASVVVASGFFVLVGQPSSVRATCLNPTPGQVLPEETSGTKEGDLSEKPGDTVLTQAKP 
EGVTGNTNSLPTPTERTEVSEETSPSSLDTLFEKDEEAQKNPELTDVLKETVDTADVDGTQASPAETTPEQVKGGVKEN 
TKDSIDVPAAYLEKABGKGPFTAGVNQVIPYELFAGDGMLTRLLLKASDNAPWSDNGTAKNPALPPLEGLTKGKYFYE 
VDLNG^^^VGKQGQALIDQLRANGTQTYKATVKVYGNKIX5KADLTNLVATKWDININGLVAKETVQKAVADN^ 
15 IDVPAAYLEKAKGEGPFTAGVNHVIPYELFAGIXjMLTRLLLKASDFCAPWSDNGDAKNPALSPLGENVICTKGQYFYQV 
ALDGNVAGKEKQALIDQFRANGTQTYSATVNVYGNKEXSKPDLDNIVATKKVTININGLISKETVQKAVADrA^KDSlDV 
PAAYLEKAKGEGPFTAGVNHVIPYELFAGDGMLTRLLLKASDKAPWSDNGDAKNPALSPLGEOTKTiCGQyFYQLALD 
GWAGKEKQALIDQFRANGTQTYSATVNVYGNKDGKPDLDNrVATKKVTININGLISKETVQBCAVADNVKTVSMFQQP 
TZ 

20 

MKLK^YILVGYIISTLLTILVVFWAVQKMLIAKGEIYFLLGNrrrVASLVGAGISLFLLLPVFTSLGKLKEHAKRVAAK^ 
SNLEVQGPVEFQQLGQTFNEMSHDLQVSFDSLEESEREKGLMlAQLSHDIKTPrrSIQATVEGrLDGIIKESEQAHYLATIG 
RQTERLNKLVEELNFLTLNTARNQVETTSKDSIFLDKLUECMSEFQFLIEQERRDVHLQVIPESARIEGDYAiCLSRILVN 
25 LVDNAFKYSAPGTKLEVVAKLEKDQLSISVTDEGQGIAPEDLENIFKRLYRVETSRNMKTGGHGLGLAIARELAHQLGG 
EITVSSQYGLGSTFTLVLNLSGSENKAZ 

MFGQTAQHGLTNSLKDFWIFLLNIGPQLAFFCQMLRCSRSVEQGTGNHRREFNMIQQIFSHFGMTHLCQIKLVYQESID 
30 LEIXVNALNHHLLIDRl^VLTPNQITIEIDRQIVHGLDLLKGRKDKEIIDIKSMFRQLELASTQQrCPINQRVHHGILAFGEIS 
DLVPAKNLPNRQD2 

MEHLATYFSTYGGAFFAALGIVLAVGLSGMGSAYGVGKAGQSAAALLKEQPEKFASALILQLLPGTQGLYGFVIGlLrw 
35 LQLTPELPLEKGVAYFFVALPlArVGYFSAKHQGNVAVAGMQILAKRPiCEFMKGAILAAMVETYAlLAFVVSFlLTLRVZ 

MLKSEKQSRYQMLNEELSFLLEGETNVLANLSNASALIKSRFPNTVFAGFYLFDGKELVLGPFQGGVSCIRIALGKGVC 
- GEAAHFQETVIVGDVTTYLNYISCDSLAKSEIVVPMMKNGQLLGVLDLDSSErEDYDAMDRDYLEQFVAILLEKTAWD 
40 FTMFEEKSZ 

MSVLEIKDLHVEIEGKEILKGVNLTLICTGEiAAIMGPNGTGKSTLSAAIMGNPNYEVTKGEVLFDGVNILELEVDERAR 
MGLFLAMQYPSEIPGCTNAEFLRAAMNAGKEDDEKISVREFITKLDEKMELLNMKEEMAERYLNEGFSGGEKKRNEIL 
45 QLLMLEPTFALLDEIDSGLDIDALKVVSKGVNAMRGEGFGAMIITHYQRLLNYrrPDVVHVMMEGRWLSGGPELAAR 
LEREGYAKLAEELGYDYKEEL2 

MPYKRQRSFSMALSKLDSLYMAVVADHSKNPHHQGFCLEDAEQISLNNPTCGDVlNLSVKFDAEDRLEDIAFLNSGCnS 
50 TASASMMTDAVLGKTKQEILELATIFSEMVQGQKDERQDQLGDAAFL5GVAKFPQRIKCATLAWNALICKTIENQEKQ2 

MKIQDLLRKDVMLLDLQATEKTAVIDEMIKNLTDHGYVTDFETFKEGILAREALTSTGLGDGIAMPHSKNAAVKEATV 
LFAKSNKGVDYESLDGQATDLFFMIAAPEGANDTHLAALAELSQYLMiCDGFADICLRQATSADQVIELFDQASEKTEEL 

DD VQAPANDSGDFIVAVTACTTGlAHTYMAQEALQKVAAEMGVGIKVETNGASGVGNQLTAEDIRKAKAillAADlCAVEM 
DRFDGKPUNRPVADGIRKTEELINI^LSGDTEVYRAANGAKAATASNEKQSLGGALYKHLMSGVSQMLPFVIGGGIM! 
ALAFLIDGALGVPNENLGNLGSYHELASMFMKIGGAAFGLMLPVFAGYVAYSIAEKPGLVAGFVAGAIAKEGFAFGKIP 
YAAGGEATSTLAGVSSGFLGALVGGFlAGALVLAlKKYVKVPRSLEGAKSrLLLPLLGTILTGFVMLAVNIPMAAINTAM 
NDFLGGLGGGSAVLLGIVLGGMMAVDMGGPVNKAAYVFGTGTLAATVSSGGSVAMAAVMAGGMVPPLAIFVATLLF 

OU KDKFTKEERNSGLTNriMGLSFITEGAlPFGAADPARAIPSFILGSAVAGGLVGLTGIKLMAPHGGIFVlALTSNALLYLVS 
VLVGAIVSGVVYGYLRKPQA2 



MANKhrrSTTRJUU>SKAELERKEAIORMLISLGlAILLIFAAFKLGAAGrTLYNLIRLLVGSLAYLAlFGLLrYLFFFKWrRK 
QEGLl^GFFTIFAGLLLIFEAYLVWKYGLDKSVLKGTMAQVVTDLTGFRTTSFAGGGLIGVALYIFTAFLFSNIGTYFIGS 
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ILILVGSLLVSPWSVYDIAEFFSRGFAKWWEGHERRKEERFVKQEEKARQKAEKEARLEQEETEKAIXDL 

ILTEEAVQNLPPIPEEKWVEPEIILPQAELKFPEQEDDSDDEDVQVDFSAKEALEYKLPSLQLFAPDKPKDQSKEKKrVRE 

NIKILEATFASFGIKVTVERAEIGPSVTKYEVKPAVGVRVNRISNI^DDLALAIAAKDVRIEAPIPGK^ 

SFRELWEQSQTKAEKFLEIPLGKAVNGTARAFDl^KMPHLLVAGSTGSGKSVAVNGIIASILMJCARPDQVKFMMVDPK 

MVEJL3VYNDIPHLLIPVVTNPRKASKALQKVVDEMENRYELFAKVGVRNIAGFNAKVEEFNSQSEYKQIPLPFIVVfVDE 

LAX>LMMVASKEVEDAnRLGQFCARAAGIHMILATQRPSVDVlSGLIKANVPSRVAFAVSSGTDSRTILDENGA£KLLGRG 

DMLFKPIDENHPVRLQGSFISDDDVERIVNFIKTQADADYDESFDPGEVSENEGEFSDGDAGGDPLFEEAKSLVIETQKA 

SASMIQRRLSVGFNRATRLMEELEIAGVIGPAEGTKPRKVLQQ2 



MSYFKKYKFDKSQFKLGMRTFKTGIAVFLVLLIFGFFGWKGtQIGALTAVFSLRESFDESVHFGTSRILGNSIGGLYALV 

FFLLOTFFHEAFWVTLWVPICTMLTIMTNVAMNNKAGVICGVAAMLirrLSJPSGETILW 

YDIDRIRLFLEKKEKZ 



MNKSEHRHQLrRAUTK^^CIHTQAELQALL^ND!QVTQATLSRD^KNMNl^KVREEDSAYYVLNNGSISKWE 
MEDALVWMRPVQHQVLLKTLPGLAQSFGSUDTLSFPDAIATLCGNDVCLIICEDADTAQKCFEELKKFAPPFFFEEZ 



MKSlKLNALSYMCmVLNtlFPItTGTYVAl^VLDRTDyGYFNSVDTILSFFLPFATYGVYNYGLRAISNVKDNKKDLNRT 

FSSLFYLCIACTILTTAVYILAYPUTTDNPIVKKVYLVMGIQLUQIFSiEWVNEALENYSFLFYKTAFrRILMLVSIFLF^ 

NEHDrvVYTLVMSLSTLIhTYLISYFWlKRDIKLVKIHLSDFKPLFLPLTAMLVFANANMLFTFLDRLFLVKTGIDVNVSY 

YTUQRIVTVIAOVVTGAIGVSVPRLSYYLGKGDKEAYVSLVNRGSRIFNFFIIPLSFGLMVLGPNAILLYGSEKYIGGGIL 

TSLFAFRTni^DTILGSQILFTNGYEKRrrVYTVFAGLLNLGLNSLLFFNHP/APEYYLLTTMLSETSLLVFYnFtHRKQL 

mLGHIFSYTVRYSI^SLSFVAlYFUNFVYPVDMVINLPFLIOTGLIVLLSAISYlSLLVFTKDSIFYEFLNHVLAUChnK^ 



MELFMKrrNYEIYKLKKSGLTNQQILKVLEYGENVDQELLLGDIADISCCRNPAVFMERYFQIDDAHLSKEFQKFPSFSIL 
DDCYPWDLSEIYDAPVU-FYKGNLDLLKFPKVAWGSRACSKQGAKSVEKVIQGLENELVIVSGLAKGIDTAAHMAAL 
QNGGiCTlAVlGTGLDVFYPKANKRLQDYIGNDHLVLSEYGPGEQPLKFHFPARNRIIAGLCRGVIVAEAJCMRSGSLITCE 
RAMEEGRDVFAIPGSILDGLSDGCHHLIQEGAKLVTSGQDVLAEFEF2 



MfCQLTVEDAKQlELEILDYIDTLCKKHNINYIINYGTLIGAVRHEGFIPWDDDIDLSMPREDYQRFINIFQKEKSKYKLLS 
LETDKNYFNNFImDSTTKIIDTR^^^KTYESGIFIDIFPIDRFDDPKVIDTCYKLESFKLI^FSKHKN^VYKDSLLKDWIRT 
AFWLLLRPVSPRYFANKIEKEIQKYSRENGQYMAFIPSKFKEKEVFPSGTFDKTIDLPFENLSLPAPEKFDTILTQFYGDY 
MTLPPEEKRFYSHEFHAYKLED2 



MTKINHLmQMKDLRDLVSDLTMTIQDGEKVAnGEEGNGKSTLLKILMGEALSDFTIKGNIQSDYQSLAYrPQKVPEDL 

KKKTLHDYFFLDSIDLDYSILYRLAEELHFDSNRFASDOEIGNLSGGEALKIQLIHELAKPFEILFLDEPSNDLDLETVDW 

LKGQIQKTRQTVinSHDEDFLSETADTIVHLRLVKHRKEAETLVEHLDYDSYSEQRKANFAKQSQQAANNQRAYDKT 

MEKHRRVKQhrVETALRATKDSTAGRLLAKKMKTVLSQEKRYEKAAQSMTQKPLEEEQIQLFFSDIQPLPASKVLVQLE 

KENI^IDDRVLVQKLQLTVRGQEKIGIIGPNGVGKSTLLAKLQRLLNDKREISLGFMPQDYHKKLQLDLSPIAYLSKTGE 

KEELQKIQSHLASLNFSYPEMQHQIRSLSGGQQGKLLLLDLVLRKPNFLLLDEPTRNFSPTSQPQIRKLFATYPGGLITVS 

HDRRFLKEVCSIIYRMTEHGLKLVNLEDLZ 



MKPKTFYNLLAEQNLPLSDQQKEQFERYFELLVEWNEKINLTAITDKEEVYLKHFYDSIAPILQGLIPNETIKLLDIGAGA 
GFPSLPMKILYPELDVTIIDSLNKRINFLQLLAQELDLNGVHFYHGRAEDFAQDKNFRAQYDFVTARAVARMQVLSELT 
IPYLKVCGKLLAUCASNAPEELLEAKNALNLLFSKVEDNl^YALPNRDPRYITVVEKKKETPNKYPRKAGMPNKRPLZ 



MSIKUAVDIDGTLVNSQKEITPEVFSAIQDAICEAGVKVVIATGRPIACVAKLLDDLQLRDEGDYVVTFNGALVQETATG 
HEIISESLTYEDYLDMEFLSRfaGVHMHArrKDGIYTANRNiGKYTVHESTLVSMPIFYRTPEEMAGKEIVKCMFIDEPEI 
LDAAIEKIPAEFYERYSINKSAPFYLELUCKWDKGSAITHLAEKLGLTKDETMAIGDEENDRAMLEVVGNPVVMENGN 
PEIKKIAKYITKTNDESGVAHAIRTWVLZ 



MTWULGVlAUVIFVrVSYNGLVKNRMQTKEAWSQIDVQLKRRNDLLPNLlETVKGYAKYEGSTLEKVAELRNQVAAA 
TSPAEAMKASDALTRQVSGIFAVAESYPDLKASANFVfCLQEELTNTENKISYSRQLYNSVVSNYNVKLETFPSNIlAGMF 
GFKAADFLQTPEEEKSVPKVDFSGLGDZ 




83 



MLFDQIASNKRKTWILLLVFFLLLALVGYAVGYLBI^GLGGLVlALnGFtYAL^MIFQSTEIVMSMNGAREVDEQTAPD 
LYHVVEDMALVAQIPMPRVFlIDDPALNAFATGSNPQNAAVAATSGLLAIMNREELEAVMGHEVSHrRNYDIRISTlAV 
ALASArrMLSSMAGRMMWWGGAGRRJ^DDDRIXjNGLElIMLVVSLLAIVLAPLAATLVQLAlSRQREFLA 
RNPQGMIKALDKLDNSKPMSRHVDDASSALYINDPKKGGGFQ!CLFYTHPPISERIERL.KQMZ 



MKLNIQEIRKQSEGLNFEQTLDLVDDLRARNQEILDVKDILAVGKVQYEDRMYFLDYQLSYTrVLASSRSMEPVELVES 
YPVTEVFMEGATNQLDQEVLDDDLVLPIENGELDLAESVSDNtLLN!PlKVLTAEE£AGQGFISGNDWQ[\rrEEEYQAQ 
KAVKKEENSPFAGLQGLFDGDEZ 

10 

MKRQLALWFSGGQDSTTCLFWVMQHYETVEAVTFAYGQRHHLEIQrrREIAKEQGIRHHILDMSLLGQITAQPDFATI 
HISYIPDKLCVESKSLKLYLFSYRNHGDFHENCINTIGKDLVNLLDPRYLEVWGKFTPRGGISIDPYYNYGKQCTKYEGL 
AEQRLFQHDLYPEKIDNR2 



MTETVEDKVSHSrrGLDILKGrVAAGAVISGTVATQTKVFTKESAVLEKTVEKTDAUVTNDTVVLGTISTSNSASSTSLSA 
SESASTSASESASTSASTSASTSASESASTSASTSISASSTVVGSQTAAATEATAKKVEEDRKKPASDYVASVTNVNLQSYA 
KRRKRSVDSI£QLLASlKNAAVFSG^mVNGAPAINASLNIAKSETKVYTCEGVDSVYRVPIYYXLKVTNDGSKLTFTYT 

20 VTYVNPKTNDLGNISSMRPGYSrYNSGTSTQTMLTLCSDLGKPSGVKNYITDKNGRQVLSYrn'STMTTQGSGYTWGNG 
AQMNGFFAiCKGYGLTSSWTVPrrGTDTSFTFTPYAARTDRIGINYFNGGGKVVESSTTSQSLSQSKSL^VSASQSASASi^ 
TSASASASTSASASASTSASASASTSASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASE 
SASTSASASASTSASESASTSASASASTSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTST 
SASASTSASESASTSASASASTSASASASTSASASASTSASASTSASESASTSASASASTSASASASTSASASASTSASASASTS 

25 ASVSASTSASASASTSASASASTSASESASTSASASASTSASASASTSASASASTSASESASTSASASASTSASESASTSASASA 
STSASASASTSASGSASTSTSASASTSASASASTSASASASISASESASTSASESASTSTSASASTSASESASTSASASASTSASA 
SASTSASASARQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQHQPVPRLQRQPVRQLQQVPVLQSQHQQVLQ 
PQHRQVPRLQQAHQHLNQRRQAPQLQQVPVRQPQRRQVRQPQQVLVHQLQHQRVHRLRRQPVHQSQQVPVRQLPHQ 
QVPRLQQAPVRRLQQVLAPQPQPQPVRQPQQVSQRLNRHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLNRHQRVRPL 

30 QQVLAPQPQRQQVHRLQHQRVRPLQQVLAPQPQRQQVHRLQRQRVRLSQHQRVRQPQQAHQLLNLHQPVRQPQHRQ 
APQLQQVPVRQPQRRQVRRLQQVPVRQPQQVPVRQPQRRQVRRPQPVHLNRHQPVRQPQQVLVHQLQHQRVHRLQH 
QPVHQSQQVPVRQFRINKCLGFSKYZ 



35 MGVETWFYSSICWI^IGLGSVWKFPYMTAANGGGGFLLIFLISTILIGFPLLLAEFALGRSAGVSAIKTFGKLGKNNECYN 
FIGWIGAFALFILLSFYSVIGGWILVYLGIEFGKLFQLGGTGDYAQLFTSIISNPAJALGAQAAFILLNIFIVSRGVQKGIERA 
SKVMMPLLFIVFVFIIGRSLSLPNAMEGVLYFUCPDFSKLTSTGLLYALGQSFFALSLGVTVMLTYASYUDKKTNLVQSG 
ISIVAMNISISIMAGLAIFQARSPFNIQSEGGPSLLFTVLPQLFDKMPFGTIFYVLFLLLFLFATVTFSVVMLEINVDNITNQD 
NSKRAKWSVILGILTFVFGIPSALSYGVMADVHIFGKTFFDAMDFLVSNLLMPFGALYLSLFTGYIFKKALAMEELHLD 

40 ERAWKQGLFQVWLFLLRFFVSSFQSSSLWSSLPNLCNQKGLE2 



MLKXWQLKDVILI^FLSIFFGGVFVGSGYVYNtl^LLLTPLGLQAFANEILFGLWCMAAPlAAlFVPRVGSATIGEVLAA 
LAEVLYGSQFGLGALLSGFVQGLGSEFGFIVTKNRYESWLSLTANSJGITLVSFVYEYIKLGYYAFSLPFVLSLLVVRFISV 
45 YFFCTILVRAIVKLYHQFATGGKAZ 



MVKVATQTPIISLFLLILSLETSRPSIALTLSVVAFCILFMLYYRilFKMLAWMIILAlLPSFANYWAVQLHGDASQAVML 
GTRAFVTVCIGLVFVSSVSLKELLLYLAQKGLSRSWSYALIVVFNSFPLIQQEIKSLKEACLLRGQELHFWSPLIYSKVLM 
50 TVFRWRHLYLRALSAHGYDEHAQLKNSYRTFYIPKKTKUYLLFFLLLQTSLFL2 



MRKHQLQVHKLTILSMMIALDWLTPrFRIEGMAPMSSVVNILAGIMMCPVYALAMATVTAnRMTTQGIPPLALTGAT 
FGALLAGLFYKYGRKFHYSALGEILGTGIIGSIVSYPVMVLFTGSAAKLSWFIYTPRFFGATLIGTAISFIAFRFLIKQEFFK 
5:) KVQGYFFSERIDZ 



MQEFTNPFPlGSSSLIHCrTNEISCEMLANGILALGCKPVMADDSREVLDFTKQSQALFINLGHLSAEKEKAIRMAASYAN 
QSSLPMVVDAVGVTTSSIRKSLVKDLLDYRPTVLKGNMSEIRSLVGLKHHGVGVDASAKDQETEDLLQVLKDWCQTYP 
60 GMSFLVTGPKDLVVSFCNQVAVLGNGCTELDWrTGTGDLVGALTAVFLSQGKTGFEASCLAVSYLNlAAEKIVVQGMG 
LEEFRYQVLNQLSLLRRDENWLDTIKGEVYEZ 



MNHKIAILSDVHGNATALEAVIADAKNQGASEYWLLGDIFLPGPGANDLVALLKDLPITASVRGNWDDRVLEALDGQ 
65 YGLEDPQEVQLLRMTQYLMERMDPATIVWLRSLPLLEKKEIDGLRFSISHNLPDKNYGGDLLVENDTEKFDQLLDAET 
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DVAVYGHVHKQLLRYGSQGQQUNPGSlGMPYFNWEALKNHRSQYAVIEVEDGELLNIQFRKVAyOYEAELELAKSKG 
LPFIEMYEELRRDDNYQGHNLELLASUEKHGYVEDVKNFFDFL2 



5 MNVNQIVRIIPTLKANNRKLNETFYIETLGMKALLEESAFL^LGIXJTGLEKLVLEEAPSMRTRKVEGRKKLARLr^ 

NPLEIEGILSKTDSIHRLYKGQNGYAFEIFSPEDDLIUHAEDDIASLVEVGEKPEFQTDLASISLSKFEISMELHLPTDIESF 
LESSEIGASLDRPAQGQDLTVD^^^VTWDLSMLKFLVNELDlASLRQKFEST£YFIPKSEKFFLGKDRNNVELWFEEVZ 



10 MKWTKIIKKIEEQIEAGIYPGASFAYFKDNQWTEFYLGQSDPEHGLQTEAGLVYDLASVSKVVGVGTVCTFLWEIGQLD 
IDRLVIDFLPESDYPDrriRQLLTHATDLDPFIPNRDLLTAPELKEAMFHLNRilSQPAFLYSDVHFLLLGFILERIFNQDLD 
VILKIXJVWKPWGMTETKFGPVELAVPTVRCVEAGIVHDPfCARLLGRHAGSAGLFSTlKDLQlFLEHYLADDFARDLNQ 
NFSPLDDKERSLAWNLEGDWLDHTGYTGTFIMWNRQKQEATIFLSNRTYEKDERAQWILDRNQVMNLIRKEEZ 



MMKKTYNHILVWGVIFYSICIVCFCFTPQEQSTVGVGTPGIQHLGRLVFLLTPFNSLWKLGEVSDIGQLCWIFLQNILNV 
FLFFPLIFQLLYLFPNLRKTKKVLLFSFLVSLGIECTQULDFFFDFNRVFEIDDLWT^^^LGCYLAWLLYKRLHKNKVR^ 

Z 

20 

MKIPLLTFARHKFVYVLLTLLFLALVYRDVLMTYFFFDIHAPDLAKFDGQAIKNDLLKSALDFRILQFNLGFYQSFIIPIII 
VLLGFQYIELKNKVLRLSIGREVSYQGLKRKLTLQVASIPCLIYLVTVLnAriTYFFGTFSPLGWIsrSLFSDGSGLQRLLDGE 
IKSYLFFTCVLLIGIFINAIYFLQIVDYVGNVTRSArTYLMFLWLGSMLLYSALPYYMVPMTSLMQASYGDVSLMKLFTP 
YILYIVPYMVLEKYEDNVZ 

25 

MFKVLQKVGKAFMLPlAlLPAAGLLLGlGGALSNPTTIATYPILDNSIFQSIFQVMSSAGEVVFSNLSLLtCVGLaGLAKR 
DKGTAALAGVTGYLVMTATIKALVKLFMAEGSAIDTGVIGALVVGIVAVYLHNRYNNIQLPSALGFFGCSRFVPIVTSF 
SSILIGFVFFVfWPPFQQLLVSTGGYtSQAGPIGTFLYGFLMRL5GAVGLHHnYPMFWYTELGGVETVAGQTVVGAQKIF 
30 FAQLADLAHSGLFTEGTRFFAGRFSTMMFGLPAACLAiMYHSVPKNRRKKYACLFFGVALTSFrrGITEPIEFMFLFVSPV 
LYWHAFLDGVSFFlADVLNISIGhrTFSGGVIDFTLFGILQGNAKTNWVLQIPFGUWSVLYYIIFRWFITQFNVLTPGRGE 
EVDSKElSESADSTSNTADYLKQDSLQnRALGGSNNIEDVDACVTRLRVAVKEVNQVDKALLKQIGAVDVLEVKGGIQ 
AIYGAKAILYKNSINEILGVDDZ 

35 MKFRKLACTVLAGAAVLGlJ>.ACGNSGGSKDAAKSGGDGAKTErrwWAFPVFTQEKTGDGVGTYEKSnEAFEi^ 

KVKLETIDFKSGPEKITTAIEAGTAPDVLFDAPGRIIQYGiCNGKLAELNDLFTDEFVKDVNNENIVQASKAGDICAYMYPI 
SSAPFYMAMNKKMLEDAGVANLVKEGWTTDDFEKVLKALKDKGYTPGSLFSSGQGGDQGTRAFISNLYSGSVTDEKV 
SKYTTDDPKFVKGLEKATSWIKDNLINNGSQFDGGADIQNFANGQTSYTILWAPAQNGIQAKLLEASICVEVVEVPFPSD 
EGKPALEYLVNGFAVFNNKDDKKVAASKKFIQFIADDKEWGPKDVVRTGAFPVRTSFGKLYEDKRMETISGWTQYYSP 

40 YYNTIDGFAEMRTLWFPMLQSVSNGDEKPADALKAFTEBCANETIfCiCAMKQZ 



MQSTEKKPLTAFTVrSTIILLLLTVLFIFPFYWILTGAFKSQPDTlVIPPQWFPKMPTMENFQQLMVQNPALQWMWNSVFI 
SLVTMFLVCATSSLAGYVLAKKRFYGQRILFAIFIAAMALPKQVVLVPLVRJVNFMGIHDTLWAVILPLIGWPFGVFLM 
45 KQFSENrPTELLESAKIDGCGEIRTFWSVAFPIVKPGFAALAIFTFI^^^WNDYFMQLVMLTSRNNLTISLGVATMQAEMA 
TNYGUMAGAALAAVPIVTVFLVFQKSFTQGITMGAVKGZ 



MKrMFKNFNNILLNRKIVLLLRIVLMMILINHLLSTAVQKQDAVIFFKREl.ISIFSYNDYS£ANLEIPKLLLNLSLFMVGW 
50 LSVILLESDLADHYHHLIRYQSSSFFDYTRfCRLVVlSKFFTQDLFVWFLGLLPLGIHFKTVALFFLLAQLMMLYLLLSYU 
ALISAGAGFSFFLYFLAFVGQEWMMDHIVTVYLVLLSLLVMLIVSRLEEKFKKGZ 



MGKGEMGKGVIGLEFDSEVLVNKAPTLQLANGKTATFLTQYDSICTLLFAVDKEDIGQEIIGIAKGSIESMHNLPVNLAG 
55 ARVPGGVNGSKAAVHEVPEFTGGVNGTEPAVHEIAEYKGSDSLVTLTTKKDYTYKAPLAQQALPETGNKESDLLASLG 
LTAFFLGLFTLGKKREQZ 



60 MKKTFFLLVLGLFCLLPLSVFAIDFKINSYQGDLYIHADNTAEFRQKIVYQFEEDFKGQIVGLGRAGKMPSGFDIDPHPKI 
QAAKNGAELADVTSEVTEEADGYTVRVYNPGQEGDIVEVDLVWNLKNLLFLYDDIAELNWQPLTDSSESIEKFEFHVR 
GDKGAEKLFFHTGKLFREGTIEKSNLDYTIRLDNLPAKRGVELHAYWPRTDFASARDQGLKGNRLEEFNKIEDSIVREK 
DQSKQL\T^VLPSILSISLLLSVCFYFIYRRKTTPSVKYAKNHRLYEPPMELEPMVLSEAVYSTSLEEVSPLVKGAGKFTF 
DQLIQATLLDVIDRGNVSUSEGDAVGLRLVKEDGLSSFEKDCLNLAFSGKKEETLSNLFADYKVSDSLYRRAKVSDEKR 

65 IQARGLQLKSSFEEVLNQMQEOVRJCRVSFWGLPDYYRPLTGGEKALQVGMGALTILPLFIGFGLFLYSLDVHGYLYLPL 
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PILGFLGLVLSVFYYWKLRLDNRDGVLNEAGAEVYYLWTSFENMLRHIARLDQAELESfVVWNRLLVYATLFGYADK 
VSHl.MKVHQIQVENPDINLYVAYGWHSTFYHSTAQ^^SHYASVA^^rASTYSVSSGSGSSGGGFSGGGGGCSIGAFZ 



5 MKKVRKIFQKAVAGLCCISQLTAFSSIVALAETPETSPAlGKVVIKETGEGGALI.GDAVFELKNNTDGrTVSQRTEAQT^ 
EAIFSNIKPGTYTLTEAQPPVGYKPSTKQWTVEVEKNGRTTVQGEQVENREEAL^DQYPQTGTYPDVQTPYQIIKVTC 
EKNGQHKALNPNPYERVIPEGTLSKRIYQVNNLDDNQYGIELTVSGKTVYEQKDKSVPLDVVILLDNSNSMSNIRNKNA 
RJt^ERAGEATJ^LrDKITSDSENRVALVTYASTIFDGTEFTVEKGVADKNGKRLNDSLFWNYDQTSrrTN^ 
TNDKNDIVELKKKVPTEAEDHDGNRLMYQFGATFTQKALMKADEILTQQARQNSQKVIFHrriXjVPTMSYPINFNH^^ 

10 FAPSYQNQLNAFFSKSPNKDGELLSDFITQATSGEHTIVRGDGQSYQMFTDFCTVYEKGAPAAFPVKPEKYSEMKAAGYA 
VIGDPl^^GGYrWL^^WR£SILAYPFNS^r^AKITNHGDPTR^VYYNG^fIAPDGYDVFTVGIGINGDPGTDEATATSFMQSISS 
KPENYTNVTDTTKILEQLNRYFHTIVTEKKSIENGTITDPMGELIDLQLGTTCRFDPAJ^YTLTANDGSRLENGQ^ 
QNDGGLLKNAKVLYDTTEKRIRVTGLYLGTDEKVTLTYWRLNDEFVSNKFYDTNGRTTLHPKEVEQNTVRDFPIPKI 
RDVRKYPEmSiCEKKLGDIEFIKVNKNDBCKPLRGAVFSLQKQHPDYPDIYGAIE>QNGTYQNVRTG£DGKLTFKNLSDG 

15 KYRLFENSEPAGYKPVQNKPIVAFQIVNGEVRDVTSIVPQDIPAGYEFTNDICHYrrNEPrPPKREYPRTGGIGMLPFYLIG 
CMMMGGVLLYTRKHPZ 



MKSINKFLTMLAALLLTASSLFSAATVFAAGTTTTSVTVHKLLATDGDMDKIANELETGNYAGNKVGVLPANAKEIAG 
20 VMFVVVThrrNNEIIDENGQTLGVNIDPQTFKLSGAMPATAMKn.T£A£6AKi=OTANlJ>AAKYKIY£IH^ 

TLTGSKAVPIEIELPLNDVVDAHVYPKNTEAKPKIDKDFKGKANPDTPRVDKDTPVNHQVGDVVEYEIVTiaPALANYA 
TANWSDRMTEGLAFNKGTVKVTVDDVALEAGDYALTEVATGFDLKLTDAGLAKVNDQf^AEKTVKITYSATLNDKAl 
VEVPESNDVTFNYGNNPDHGNTPKPNKPNENGDLTLTKTWVDATGAPIPAGAEATFDLVNAQTGfCVVQTVTLTTDKN 
TVTVNGLDKNTEYKFVERSIKGYSADYQEITTAGEIAVKNWKDENPKPLDPTEPKVVTYGKKFVKVNDKDNRLAGAEF 
25 VlANADNAGQYLARKADKVSQEEKQLVVrnCDALDRAVAAYNALTAQQQTQQEKEKVDKAQAAYNAAVlAANNAF 
EWVADKDNENVVKLVSDAQGRFErrGLLAGTYYLEETKQPAGYALLTSRQKFEVTATSYSATGQGIEYTAGSGICDDAT 
KVVNKKmPQTGGIGTIIFAVAGAAlMGIAVYAYVKNNKDEDQLAZ 



30 MTMQKMQKMISRIFFVMALCFSLVWGAHAVQAQEDHTLVLQLENYQEVVSQLPSRDGHRLQVWKLDDSYSYDDRV 
QrVRDLHSWDENKLSSFKKTSFEMTFLENQIEVSHIPNGLYYVRSIIQTDAVSYPAEFLFEMTDQTVEPLVIVAKKTDTM 
TTKVKLIKVDQDHNRLEGVGFKLVSVARDVSEKEVPLIGEYRYSSSGQVGRTLYTDFCNGEIFVTNLPLGNYRFKEVEPL 
AGYAVTTLDTDVQLVDHQLVrrrVVNQKLPRGNVDFMKVDGRTNTSLQGAMFKVMKEESGHYTPVLQNGKEVVVTS 
GKDGRFRVEGLEYGTYYLWELQAPTGYVQLTSPVSFTIGKDTRKELVTVVKNNKRPRIDVPDTGEETLVYLDACCHFV 

35 VW2 



MSHIYLSIFTSLLLMLGLVNVAQADEYLRIGMEAAYAPFNWTQDDDSNGAVKIDGTNQYANGYDVQIAKKLAKDLGKE 
PLVVKTKWEGLVPALTSGKIDMIlAGMSPTAERfCQElAFSSSYYTSEPVLLVKKDSAYASAKSLDDFNGAfCrrSQQGVYL 
40 YNLIAQIPGAKKETAMGDFAQMRQALEAGVIDAYVSERPEALTAEAANSKFKMIQVEPGFKTGEEDTAIAIGLRKNDNR 
ISQINASIETISKDDQVALMDRMIKEQPAEATTTEETSSSFFSQVAKILSENWQQLLRGAGrTLLISIVGTnGUrGLAIGVFR 
TAPI^E^OCVIYGLQKLVG^^^LNVYIEIFRGTPMIVQSMVIYYGTAQAFGINLDRTl^rFIVSI^^^GAYMTEIVRGGILAV 
DKGQFEAATALGMTHNQTMRKIVLPQVVRNILPATGNEFVINIKDTSVLNVISVVELYFSGNTVATQTYQYFQTFTIIAV 
lYFVLTFTVTRILRflERRMDMDTYTTGANQMQTEDLKZ 

45 



MTQAILEIKHLKKSYGQNEVLKDISLTVHKGEVISIIGSSGSGKSTFLRSINLLETPTDGQILYHGQNVLEKGYDLTQYREK 
LGMVFQSFNLFENLNVLE^^^IVAQTTVLKR£RTEAEKIAKENLEKVGMGERYWQAKPKQLSGGQKQRVAlARALSMN 
PDAILFDEPTSALDPEMVGEVLKIMQDLAQEGLTMIVVTHEMEFARDVSHRVIFMDKGVIAEEGKPEDLFTNPKEDRTK 
50 EFLQRYLKZ 



MKKYQLLFKISAVFSYLFFVFSLSQLTUVQNYWQFSSQIGNLFWIQNILSLLFIGVMIVVLViCTGHGYLFRIPRKKWLW 

YSlLTVLVLVFQlSFNVQTAKHVQSTAEGWAVLIGYSGTNFAELGrYlALFFLVPLMEELIYRGLLQHAFFKHSRFGLDLL 

LPSILFALPHFSSLPSLLDIFVFATVGIIFAGLTRYTICSIYPSYAVHVINNIVATFPFLLTFLHRVLGZ 



MNKKQWLGLGLVAVAAVGLAACGNRSSRNAASSSDVKTKAAIVTDTGGVDDKSFNQSAWEGLQAWGKEHNLSKDN 
GFTYFQSTSEADYANNLQQAAGSYNLrFGVGFALNNAVKDAAKEHTDLNYVUDDVIiCDQKNVASVTFADNESGYLA 
GVAAAKTTKTKQVGFVGGIESEVISRFEAGFICAGVASVDPSIKVQVDYAGSFGDAAKGKTIAAAQYAAGADIVYQVAC 
GTGAGVFAEAKSLNESRPENEKVWVIGVDRDQEAEGICYTSICDGKESNFVLVSTLKQVGTTVKDISNKAERGEFPGGQV 
IVYSLKDFCGVDLAVTNLSEEGKKAVEDAKAKILDGSVKVPEKZ 
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MSKXLQQISVPLrSVFLGILLGArV^MWIFGYDAlWGYEELFYTAFGSLRGIGEIFRAMGPLVUGLGFAVASIUGFFW 
PGQALAGWlL$GWFAL5HPDMPIU>LMILATIVIALUGGrVCAlFGrLRAYLGTSEVIVTlMMNYrVLVVGNAnH^ 
FMQSTDSTIRVGANATYQTPWUVELTGNSRMNIGIFFAllAVAVIWFMLKKTrLGFEIRAVGLNPHASEYAGISAK^ 
SMIISGALAGLGGAVEGLGTFQKVYVQGSSLAIGFNGMAVSLLAANSPIGILFAAFLFGVLQVGAPGMNAAQVPSELVSI 
5 VTASIIFFVSVHYLIERFVKPKICQVKGGKZ 



MGVKKiCLKLTSIXGLSLLIOTACATNGVTSDrrAESADFWSKLVYFFAEIIRFLSFDrSIGVGIILFTVURTVLLPVFQV 
MVASRKMQEAQPRIKALREQYPGRDMESRTKLEQEMRKVFKEMGVRQSDSLWPIUQMPVILALFQALSRVDFLKTGH 
10 FLWINLGSVDTTLVLPILAAVFTFLSTWl^NKALSERNGATTAMMYGIPVLIFIFAVYAPGGVALYWTVSNAYQ^ 
FLNNPFKIUEREAVVQAQKDLENRKRKAKKKAQKTKZ 



MVIDPFAINELDYYLVSHFHSDHIDPYTAAAILNNPKLEHVKFIGPYHCGRIWEGWGVPKERJIVVKPGDTIELKDMKIH 
15 AVESFDRTCLVTLPVNGADETGGELAGLAVTDEEMAQKAVNYIFETPGCTIYHGADSHFSNYFAKHGFCDFKIDVALNN 
YGENPVGIQDKMTSIDLLRMAENLRTKVHPVHYDIWSNFMASTNEILELWKMRKDRLQYDFHPFIWEVGGKYTYPQD 
QHLVEYHHPRGFDDCFEQDSNIQFKALL2 



20 MFLSGWI^SFANTYmDLLGVLFPI)SPFLNAFESAIAAPLVEEPLKLI^i:VFVi.A^-lPVRKLKSLFLl.GIASCLGFC^ 

GYIRTDLPEGFDFTISRILERIISGIASHWTFSGLAWGVYLLYRAYKGQKVGKKQGUFLGLALGTHFLFNSPFVELETEL 
PLAIPVVTAIALYGFYHAYCFVEKHNELMT2 



25 MKVEPRCDVLSRMSHFFIRIUMELQELVERSWAIRQAYHELEVKHHDSKWTVEEDLLALSNDIGNFQRLVMTKQGRY 
YDETPYTLEQJCLSENIWWLLELSQRLDIDILTEMENFLSDKEKQLNVRTWK2 



MLDWKQFFLAYLRSRSRLFIYLLSLAFLVLLFQFLFASLGIYFLYFFFLCCFVTILFFTWDILVETQVYRQELLYGEREAK 
30 SPLEIALAEKLEAREMELYQQRSKAERKLTDLLDYYTLWVHQIKTPIAASQLLVAEVVDRQLKQQLEQEIFKIDSYTNLV 
LQYLRLESFHDDLVLKQVQIEDLVKEIIRKYALFFIQKGLNVNLHDLDICEIVTDKKWLLVVIEQIISNSLKYTKEGGLEIY 
MDDQELCIKDTGIGIKNSDVLRVFERGFSGYNGRtTQQSSGLGLYLSKKISEELGHQIRIESEVGKGTTVRIQFAQVNLVL 
E2 

35 

MELNTHNAEILLSAANKSHYPQDELPEIALAGRSNVGKSSFINTMLNRKNLARTSGKPGKTQLLNFFNIDDKMRFVDVP 
GYGYARVSKKEREKWGCMIEEYLTTRENLRAVVSLVDLRHDPSADDVQMYEFLKYYEIPVIIVATKADKIPRGKWNKH 
ESAIKKKXNFDPSDDFILFSSVSKAGMDEAWDAILEKLZ 

40 

MTKKQLHLVrVTGMSGAGKTVAIQSFEDLGYFTlDNMPPALLPKFLQLVEIKEDNPKLALWDMRSRSFFSEIQAVLDE 
ENQDGLDFKILFLDAADFCELVARYKETRRSHPLAADGRILDGIKLERELLAPLKNMSQNVVDTTELTPRELRKTLAEQF 
SDQEQAQSFRJEVMSFGFKYGIPIDADLVFDVRFLPNPYYLPELRNQTGVDEPVYDYVMNHPESEDFYQHLLAUEPILP 
SYQKEGKSVLTIAMGCTGGQHRSVAFAKRLAQDLSKNWSVNEGHRDKDRRKETVNRS2 

45 



MRKPKrrVIGGGTGSPVILKSLREKDVElAAlVTVADDGGSSGELRKNMQQLTPPGDLRNVLVAMSDMPKFYEKVFQYR 

FSEDAGAFAGHPLGNLIIAGLSEMQGSTYNAMQLLSKFFHTTGKIYPSSDHPLTLHAVFQDGTEVAGESHIVDHRGUDN 

VYVTNALNDDTPLASRRVVQTILESDMrVLGPGSLFTSILPNIVlKEIGRALLETKAEIAYVCNIMTQRGETEHFTDSDHV 

EVLHRHLGRPFIDTVLVNIEKVPQEYMNSNRFDEYLVQVEHDFVGLCKQVSRVISSNFLRLENGGAFKDGDLIVDELMR 

I 

IQVKKZ 



55 MKNLIKLLIIRLIVNLADSVFYIVALWHVSNNYSSSMFLGrFIAVNYLPDLLUFFGPVIDRVNPQKILIISILVQLAVAVIFL 
LLLNQISFWVlMSLVFISVMASSISYVIEDVLfPQVVEYDICIVFANSLFSISYKVLDSIFNSFASFLQVAVGFILLVKIDrGlFL 
LALFILLLLKFRTSNANIENFSFKYYKREVLQGTKFILNNKLLFKTSISLTLINFFYSFQTVVVPIFSIRYFDGPIFYGIFLTIA 
GLGGILGNMLAPIVIKYLKSNQtVGVFLFLNGSSWLVAlVIKDYTLSLILFFVCFMSKGVFNIIFNSLYQQIPPHQLLGRVN 
TTlDSIISFGMPlGSLVAGTUDLNIELVHAISlPYFLFSYlFYTDNGLKEFSrYZ 



65 



MMSNKNKEILIFAILYTVLFMFTCVKLI^LMPSAIANYLVYVVLALYGSFLFKDRLIQQWKEIRKTKRKFFFGVLTGW 
LFLILMTVVFEFVSEMLKQFVGLDGQGLNQSNIQSTFQEQPLLIAVFACVIGPLVEELFFRQVLLHYLQERLSGLLSIILV 
GLVFALTHMHSLALSEWIGAVGYLGCGLAFSIIYVKEKENIYYPLLVHMLSNSLSLIILAISIVKZ 
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LKICPtmFKNVSKVFEDSNnT:VUaDINFELEEGKFYTU.GASGSGKSTlLNnAGLLDATT^ 
TVFQSYALH>HMWFENVAi?LiaJtKIDKKEIEQRVAEVLKMVQLEGYEKRSIRKLSGGQRQRVAI^^ 
EPLSALDLKLRTDMQYELRELQQRLCrrFVI^HIXJEEALAMSDWrFVMNDGEIVQSGTPVDIYDEPINHFVATFIGESN 
5 ILPGTMIEDYLVEFNGKRFEAVDGGMKPNEPVEVVIRPEDLRITLPEEGKLQVKVDTQLFRGVHYEIIAYDELGNEWMI 
HSTRKAIVGEEIGLDFEPEDIHIMRLNETEEEFDARIEEYVEIEEQEAGLINAIEEERDEENKLZ 



MKSMRILFLLALIQISLSSCFLWKECILSFKQSTAmGSMVFVSGICAGVNYLYTRKQEVHSVLASKKSVKLFfSM 
10 LLGAVLVLSDNL^K^m-QQELVDFLLPSFFFLFGLDLLIFLPLKKYVRDFLAMLDRKKTVLVTILATIXFL^ 
LIYIGLGLFFAAYLVPNSVKKEVSFYGHIFRDtVLVIVTLIFFZ 



MVKKnGMVUVLLSVTVVGVGVFAYTIYQQGTETLAKTYKKIGEETKVIEATEPLTILLMGVDTGNVERTETWVGRSDS 
15 MILMTVNPKTKKTTMMSLERDILTIOESGNGQAHEAKLNSAYADGGAELAIETIQKMMNIHIDRYVMVNMRGLQK^ 

DA VGGirVNNILGFPIS IS DQEEFNTISIGVGEQHlGGEEALVYARMRYQDPEGDYGRQKRQREVlQfCVMEKALSLNSIGH 
YQEILKAL^DNMQTNIDLSAKSIPNLLGYKDSFKTIETQQLQGEGEILQGVSYQrVSRAHMLEMQNLLRRSLGQEEVTQL 
ETNAVLFEDLFGRAPVGDEDN2 

20 

MKKQAYVIIALTSFLFVFFFSHSLLEILDFDWSIFLHDVEKTEKFVFLLLVFSMSMTCLLALFWRGIEELSLRKMQANLK 
RLLAGQEVVQVADPDLDASFKSLSGKLNLLTEALQKAENQSLAQEEEIIEKERKRIARDLHDTVSQELFAAHMILSGISQ 
QALKLDR£KMQTQLQS\TAILETAQKDLRVLLLHLRPVELEQKSLIEGIQILLKEL£DKSDLRVSLKQNMTKLPKKIEEHI 
FRILQELISNTLRHAQASCLDVYLYQTDVELQLKVVDNGIGFQLGSLDDLSYGLRNIKERVEDMAGTVQLLTAPKQGLA 
25 VDIRIPLLDKEZ 



WrVSIISQGFVWAILGLGIFMTFRILNFPDMTTEGSFPLGGAVAVTLrTKGVNPFLATLVAVGAGCLAGMAAGLLYTKGK 
IPTLLSGILVNfTSCHSIMLLIMGRANLGLLGTKQIQDVLPFDSDLNQLLTGLIFVSIVlALMLFFLDTKLGQAYIATGDNP 
30 DMARSFGIHTGRMELMGLVLSNGVIALAGALIAQQEGYADVSRGIGVIVVGLASLIIGEVIFKSLSLAERLVTP/VGSIAY 
QFLVWAVIALGFNTSYLRLYSALILAVCLMIPTFKQTILKGAKLSKZ 



MKKMKVWSTVI^TGVALTTLAACSGGSNSTTASSSEEKADKSQELVIYSNSVSNGRGDWLTAKAKEAGFNIKMVDIAG 
35 AQLADRVIAEKNNAVADMVFGIGAVDSNKlRDQKLLVQYKPKWLDiaDQSLSDKDNYYNPVIVQPLVLIGAPDViCEMP 
KDWTELGSKYKGKYSISGLQGGTGRAILASILVRYLDDKGELGVSEKGWEVAKEYLKNAYTLQKGESSIVKMLDKEDPI 
QYGMMWGSGALVGQKEQNVVFKVMTPEIGVPFVTEQTMVLSTSICKQALAKEFIDWFGQSEIQVEYSKNFGSIPANKD 
ALKDLPEDTKKFVDQVKPQNIDWEAVGKHLDEWVEKAELEYVQ2 

40 

MIKFDNIQIKYGDFVAlDNLNLDIHEGEFFTFLGPSCCGKSrrLRALVGFLDPSSGSIEVNGTDVTHLEPEKRGIGIVFQSY 
ALFPTMTVFDNIAFGLKVKKVAPDVIKAKVSAVAAKIKISDQQLQRNVSELSGGQQQRVALARALVLEPKILCLDEPLS 
NLDAKLRVDLRKELKRLQKELGnTLYVTHDQEEALTLSDRIAVFNNGYIEQVGTPVEiyHNSQTEFVCDFIGDINVLTD 
ETVHEVLLKNTSVFLEDKKGYIRLEKVRFNRETEQDFILKGTIIDVEFSGVTIHYTIKVSESQILNVTSIDSQAAIRSVGESV 
45 ELFITPSDVLQF2 



MRHKLNLKDWLIRLGLrWFLVTniYPNFDLVVNVFVKGGEFSLDAVHRVLKSQRALQSrMNSFKLAFSLirTVNVVGIL 
CVLFTEYFDIKGAKILKLGYMTSLIYGGVVLATGYKFVYGPYGLrTKFLQWIPSLDPNWFIGYGAVLFIMTFSGTANHT 
50 LFLThn^IRSVDYHTIEAARNMGAKPFrVFRKVVLPTLIPTLFALTIMVFLSGLSAVAAPMIVGGKEFQTINPMIITFAGMG 
NSRDLAALLAIILGIATTILLTIMNKIEKGGNYISISKTKAPLKKQKIASKPWNIIAHIVAYGLFTVFMLPUFIVLYSFTDPV 
AIQTGNLTLSNFTLENYRLFFSNSAAFSPFLVSFIYSUAATTATILAVVFARVVRXKKSRFDFLFEtGALLPWLLPSTLLA 
VSLLFTFNQPQFUVLNQILVGSLVILUAYIVVKIPFSYRMVRAILFSVDDEMEDAARSMGASPFYTMMKVIIPFILPVVLS 

vulnfnslltdfdlsvflyhplaqplgitirsagdetatsnaqalvfvytivlmiisgtvlyftqrpgrkvrkz 
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Tabit 3 

ID201 - 4X06.4 

5 atgataaaaaatcctaaattattaaccaagtcttttttaagaagttttgcaattctaggtggtc 
tcatataggtatttatttgacctttcctttttattatattcaacrggagggggaaaagrrt 
tgtttacggagtarrtaaagactaagacatctgatgaaattccaagcttactccagtcttattcaaa^ 
atatctgctcaccttaaaagagatattgtagataagcggctccctcttgtgcatgacttggatatt;^ 
gctatcaaattatatcgtgatgttagatatgtctgttagtacagcagatggtaaacaggtaaccgtgcaatttgct 
10 acggggtggatgtctacaaagaagcaaagaatattttgcttttgtatctcccat^ 
ttttcctttgttttttcttarrtttatactaaacgcttgctc^ 

aatgcaagatttggatgacaatattcgttttgatgaaagtaggaaagatgaagttggtgaagttggaaaac^ 

atggtatgtatgagcacttgrtgaaggttatttatgagttggaaagtcgtaatgagcaaattgtaaaact 

caaaaggtttcctttgtccgcggagcatcacatgagttgaaaacccctttagccagtcttag^ 

15 tatgcagcataatattggagattacaaagatcatccaaaatatattgcaaagagtataaataagattgaccagat^ 
gccacttattagaagaagtactggagtcttctaaattccaagagtggacagagtgtcgtgagaccttgactg^ 
ccagttttagtagatattttatcacgttatcaagaattagctcattcaataggtgttacaattgaaaatcaattgac 
agatgctaccagggtcgtcatgagtcttagggcattggataaggttttgacaaacctgattagtaatgcaattaaat 
attcagataaaaatgggcgtgtaatcatatccgagcaagatggctatctctctatcaaaaatacatgtgcgcctcta 

20 agtgaccaagaactagaacatttatttgatatattctatcattctcaaatcgtgacagataaggatgaaagttccgg 

TTTGGGTCTTTACATTGTGAATAATATTTTAGAAAGCTATCAAATGGATTATAGTTTTCTCCCTTATGA^ 
TGGAATTTAAGATTAGCTTGTAG 



25 MIKNPKIXTKSrLRSFAILGGVGLVIHIAIYLTFPFYyiQLEGEKFNESARVrrEYLKTKTSDEIPSLLQSySKSLT 
I S AHLKRD I VDKRLPLVHDLD I KDGKLSNYI VMLDMS VSTADGKQVTVQ FVHGVDVYKEAKNI LLL YLP YTFLVTI A 
FSFVFSYFYTKIlLLNPI^FYISEVTSKMODLDDNIRFDESRiaDEVGEVGKQINGMYEHLLKViyELESRNEQ 
QKVS FVRGASHE LKTPLAS LRI I LENMQHNI GD YKDHP KY I AKS INKI DQMSHLLEEVLESS KFQEWTECRETLTVK 
PVLVDILSRYQELAHSIGVTIENQLTDATRVVMSLiyU-DKVLTNLISNAIKYSDKNGRVII^ 

30 SDQELEHLFDIFYKSQIVTDKDESSGLGLYIVNNILESYQMDYSFLPYEHGMEFKISLZ 

ID202 - 4106.9 

ATGGATAAAATTATTAAAACTATATCAGAAAGCGGAGCCTTTCGTGCTTTTGTCCTTGATAGCACTGAAACCGTCCG 
35 CACTGCTCAAGAAAAACATCAAACCCAAGCTAGCTCAACTGTAGCGCTTGGTCGAACTCriTATCGCTA 

TCGCAGCCA^TGAAAAAGGAAATACCAAACTTACAGTTAAGGTGTTGGGATCTAGCTCTCTAGGTGCTACT 
GTCGCTGATACCAAGGGGAACGTCAAAGGCTATGTTCAAAATCCTGGTGrrGACATCAA^^ 
AGTCCTAGTCGGACCTTTTGTTGGAAATGGTCAATTCCTCGTTATCACAGACTACGGTACTGGAAATCCT 
CTATAACTCCCCTCATCTCTGGAGAAATCGGTGAAGACCTTGCCTTTTACCTTACTGAAAGCCAACAAACGCCTTC^ 
40 GCGGTCGGCCTCAATGTCCTTTTGGACGAGGAAGACAAGGTCAAGGTTGCAGGTGGTTTCCTAGTTCAAGTCr^ 

AGGAGCCAAGAAAGAAGAGATTGCTCGCTTTGAAAAACGCATCCAAGAAATGCCAGCTATCTCrrACTCTTCTCGAAA 
GCGACGACCATATCGAAGCCCrcCTCAAGGCTATCTACGGGGACGAAGCCTACAAGCGTCrTTCTGAAGAAGAAATC 
CGTTTCCAATGTGACTGTAGCCATGAACGCTTTATGAACGCTCTTGCCAGCCTTCCAAGCTCAGACTTACAGGAA^ 
GAAAGAGGAAGACCACGGGGCAGAAATCACTTGTCAATTCTGCCAAACTACTTACAACTTTGATGAAAAGGACCT 
45 AGGAACTCATTCGTGACAAATCTTAA 



^^KXIKTISESGAFRAFVI^STETVRTA0EKHQT0ASSTVALGRTLIASQI1J;ANEKGNTKLTVKVLGSSSLG^ 
VADTKGNVKG YVQNPGVD I KKTATGEVLVGP FVGNGQ FLV I TD YGTGNP YNS ITPLI SGE I GEDIAFYLTESQQTPS 
50 AVGI^LLDEEDKVKVAGGFLVQVLPGAICKEEIARFEKRIQEMPAISTLLESDDHIEALLKAIYGDEAYKRLSESEI 
RFQCDCSHERFMNALASLPSSDLOEMKEEDHGAErTCQFCQTTYNFDEKDLEELIRDKSZ 



ID203 - 4115 

55 

ATGAAATCAATAACTAAAAAGATTAAAGCAACTCTTGCAGGAGTAGCTGCCTTGTTTGCAGTATTTGCTCCA 
TGTATCTGCTCAAGAATCATCAACTTACACTGTTAAAGAAGGTGATACACTTTCAGAAATCGCTGAA^ 
CAACAGTTGAAAAATTGGCAGAAAACAACCACATTGATAACATTCATTTGATTTATGTTGATCAAGAGTTGGCT 
GATGGCCCrrGTAGCGCCTGTTGCAACACCAGCGCCAGCTACTTATGCGGCACCAGCCGCTCAAGATGAAACTGTTTC 
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AGCTCCAGTAGCAGAAACTCCAGTAGTAAGTCyvAACAGTTGTTTCAACTGTAAGCGGATCTGAAGCAG^ 
AATGGATCGCTCAAAAAGAATCAGGTGGTAGTATACAGCTACPJUVTGGACGTTATATCGGACGTTACCAATTAA 



5 MKSITKKIKATLAGVAALFAVFAPSFVSAOESSTYTVKEGDTLSEIAETHNTTTOKIJ^m 

DGPVAPVATPAPATYAAPAAQDETVSAPVAETPWSETWSTVSGSEAEAKEWIAQKESGGSIQLQMDVISDVTN2 

ID204 - 4117.1 

10 atgaatttaggagaattttggtacaataaaataaataagaacagaggaagaaggttaatgaagaaact 

ttt tt tagctctgct all - ri ' tc^ ' r agctagtccagagggtgcaatggctagtgatggtacttggcaaggaa^ 
atctgaaagaagatggcagtcaagcagcaaatgagtgggtttttgatactcattatcaatcttggttctatat;^^ 
gcagatgctaactatgctgaaaatgaatggctaaagcaaggtgacgactatttttacctcaaatct 
ggccaaatcagaatgggtagaagacaagggagccttttattatcttgaccaagatggaaagatgaaaagaaatgcct 

15 gggtaggaacttcctatgttggtgcaacaggtgccaaagtaatagaagactgggtctatgattctcaatacgatgct 
tggttttatatcaaagcagatggacagcacgcagagaaagaatggctccaaattaaagggaaggact^ 
atccggtggttatctactgacaagtcagtggattaatcaagcttatgtgaatgctagtggtgccaaagtacagcaag 
gttggctttttgacaaacaataccaatcttggttttacatcaaagaaaatggaaactatgct 
ttcgagaatggtcactattattatctaaaatccggtggctacatggcagccaatgaatggatttgggataaggaatc 

20 ttggttttatctcaaatttgatgggaaaatggctgaaaaagaatgggtctacgarrctcatagtcaagc^ 

acttcaaatccggtggttacatgacagccaatgaatggatttgggataaggaatcttggttttatctcaaatct 
gggaaaatagctgaaaaagaatgggtctacgattctcatagtcaagcttggtactacttcaaatccggtot 
gacagccaatgaatggatttgggataaggaatcttggttttacctcaaatctgatgggaaaatagctga 
gggtctacgattctca.tagtcaagcttggtactacttcaaatcrggtggctacatggcgaaaa^ 

25 ggttatcagcttggaagcgatggtaaatggcttggaggaaaaactacaaatgaaaatgctgcttactatc^ 

gcctgttacagccaatgtttatgattcagatggtgaaaagctttcctatatatcgcaaggtagtgtcgtatggctag 
ataaggatagaaaaagtgatgacaagcgcttggctattactatttctggtttgtcaggctatatgaaaacagaagat 
ttacaagcgctagatgctagtaaggactttatcccttattatgagagtgatggccaccgtttttatca^ 
tcagaatgctagtatcccagtagcttctcatctttctgatatggaagtaggcaagaaatattattcggcaga 

30 tgcattttgatggttttaagcttgagaatcccrrtccttttcaaagatt^ 

gaattggataaggtatttagtttgctaaacattaacaatagccttttggagaacaagggcgctacttttaaggaagc 
cgaagaacattaccatatcaatgctctttatctccttgcccatagtgccctagaaagtaactggggaag^ 
ttgccaaagataagaataatttctttggca7taca.gcctatgatacgaccccttacctttctgctaagacat^ 
gatgtggataagggaattttaggtgcaaccaagtggattaaggaaaattatatcgataggggaagaactttcc^ 

3 5 aaacaaggcrrtctggtatgaatgtggaatatgcttcagacccttattggggcgaaaaaattgctagtgtgatgatga 
aaatcaatgagaagctaggtggcaaagattag 



MNLGSFWYNKINKNRGRRLMKKVRFIFIJ^LLFFU^PEGAHASDGTWOGKQyL^ 

40 ADANYAENEWLKQGDDYFYLKSGGYMAKSEWEDKGAFYYLDQDGKMKRNAWGTSYVGATGAKVIEDWVYDSQYDA 
WFYIKADGOHAEKEWLQIKGKDYYFKSGGYLLTSQWINQAYVNASGAKVQQGWLFDKQYOSWFYIKENGNYADK^ 
FENGHYYYLKSGGY^WVNEWIWDKESWFYLKTDGKMAEKEWVYDSHSOAWYYFKSGGYMTANEWIWDKESW^ 
GKIAEKEVATfDSHSQAWYYFKSGGYMTANEWIWDKESWFYLiCSDGKIAEKEWVYDSHSOAWYYFKSGGYMAKNET^ 
GYQLGSDGKWLGGKTTNENAAYYQWPVTANVYDSDGEKLSYISOGSVVWLDKDRKSDDKRIAITISG^ 

45 LOALDASKDFIPYYESDGHRFYHYVAQNASIPVASHLSDMEVGKKYYSADGLHFDGFKLENPFLFKDLTEATNYSAE 
EIJDKVFSLLNINNSLLENKGATFKEAEEHYHINALYLLAHSALESNWGRSKIAKDKNNFFGITAYDCT 
DVDKGILGATKWIKENYIDRGRTFLGNKASGMNVEYASDPYWGEKIASVMMKINEKLGGKD2 

XD2Q5 - 4118.1 

50 ATGAAAAAATTAGGTACATTACTCGTTCTCTTTCT7TCTGCAATCATTCTTGTAGCATGTGCTAGCGGAAAAAAAGA 
TACAACTTCTGGTCAAAAACTAAAAGTTGTTGCTACAAACTCAATCATCGCTGATATTACTAAAAATATTGCTGG^ 
ACAAAATTGACCTTCATAGTATCGTTCCGATTGGGCAAGACCCACACGAATACGAACCACTTCCTGAAGACGTTAAG 
AAAACTTCTGAGGCTAATTTGATrrrCTATAACGGTATCAACCTTGAAACAGGTGGCAATGCTTGGT^ 
GGTAGAAAATGCCAAGAAfiACTGAAAACAAAGACTACTTCGCAGTCAGCGACGGCGTTGATGTTATCTACCTTGAAG 

5 5 GTCAAAATGAAAAAGGAAAAGAAGACCCACACGCTTGGCTTAACCrrTGAAAACGGTATTATTTTTGCT AA^^ 

GCCAAACAATTGAGCGCCAAAGACCCTAACAATAAAGAATTCTATGAAAAAAATCTCAAAGAATATACTGATAAGTT 
AGACAAACTTGATAAAGAAAGTAAGGATAAATTTAATAAGATCCCTGCTGAAAAGAAACTCATTGTAACCAGCGAAG 
GAGCATTCAAATACTTCTCrrAAAGCCTATGGTGTCCCAAGTGCTTACATCTGGGAAATCAATACTGAAGAAG^ 
ACTCCTGAACAAATCAAGACCTTGGTTGAAAAACrrCGCCAAACAAAAGTTCCATCACTCT 

60 GGATGACCGTCCAATGAAAACTGTTTCTCAAGACACAAACATCCCAATCTACGCTCAAATCTTTACTGACTCTATCG 
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CAGAACAAGGTAAAGAAGGCGACAGCTACTACAGCATGATGAAATACAACCTTGACAAGATTGCTG^^ 
AAATAA 



5 MKKLGTLLVLFLSAI ILVACASGKKDTTSGOKLKWATNS 1 1 ADITKNI AGDKIDLHS I VP IGQDPHEYEPLPEDVK 

iCTSEANLIFYNGINLErrGGNAWrrKI.VENAKKTENKDYFAVSDGVDVIYLEGQNEKGKEDPHAWI^LENGIIFAK:^ 
AKQLSAKDPNNKEFYEKNLKEYTDKXJJKIJ^KESKDKFNKIPAEKKLIVTSEGAF^ 

TPEQ I KTLVEKI.RQTKVPSLFVESSVDDRPMKTVSQDTNI P I YAQ I FTDS I AEQGKEGDSYYSMMKYNIXiK^ 
K2 

10 

ID206 ■ 4119.1 

atggaatggtataaaaaaatcggacttcttgcaactacaggtttagctttgtttgggcrcg 

tggtaaatcttgcggatggcacagtgaccatcgagtatttcaaccagaaaaaagaaatgaccaaaaccttggaaga^ 

tcactcgtgattttgagaaggaaaaccctaagatcaaggtcaaagtcgtcaatgtaccaaatgctggtgaagt^ 
15 aagacacgcgttctcgcaggagatgtgcctgatgtggtcaatatttacccacagtccatcgaactgcaagaatgggc 

aaaagcaggtgtttttgaagatttgagcaacaaagactacctgaaacgcgtgaaaaatggctacgctgaaa;^ 

ctgtaaacgaaaaagtttacaacgttccttttacagctaatgcttatggaatttactac^ 

gaactgggcttgaaggttcctgaaacctgggatgaatttgaacagttagtcaaagatatcgttc 

accatttggaattgcaggtgcagatgcttggacactcaatggttacaatcaattagcctct 
20 gaggaaaagaagcaaatcaataccttcgttattctcaaccaaatgccattaaattgtcggatccgattatgaaagat 

gatatcaaggtcatggacatccttcgcatcaatggatctaagcaaaagaactgggaaggtgctggctataccgatg 

TATCGGAGCCTTCGCACGTGGGGATGTCCTCATGACACCAAATGGGTCTTGGGCGATCACAGCGATTAATGAACAAA 
AACCGAACTTTAAGATTGGGACCTTCATGATTCCAGGAAAAGAAAAAGGACAAAGCTTAACCG 
TTGGCATGGTCTATCTCAGCCACCACCAAACATCCAAAAGAAGCCAATGCCTCT 
25 AGTCATGCAAAAATACTACGATGTGGACGGATCTCCAACAGCGATCGAAGGGGTCAAACAAGCAGGAGAAGATTCAC 
CGCTTGCTGGTATGACCGAATATGCCTTTACGGATCGTCACTTGGTCTGGTTGCAACAATACT 
GACTTCCATACCTTGACCATGAACTATGTCTTGACCGGTGATAAACAAGGCATGGTCAATGATTTGAATGCCTTCTT 
TAACCCGATGAAAGCGGATGTGGATTAG 

30 

MEWYKKIGLtATTGIJU.FGLGACSl^GKSADGTVTIEYFNQKKE^r^KTLEEITRDFEKENP 

KTRVLAGDVPDWNI YPQS I ELQEWAKAGVFEDLSNKD YLKRVKNGYAE KYAVNEKVYNVP FTANAYG I YYNKDKFE 
ELGLKVP ETWDE FEQLVKD I VAKGQT P FG I AGADAWTI^G YNQLAFATATGGGKEANQ YliRYSQ PNAI KLSD P IMKD 
D I KVMD I LRINGS KQKNWEGAGYTDVI GAFARGDVIJ^TPNGSWAI TAINEQKPNFKI GTFMI PGKEKGQSLTVGAGD 
35 LAWSISATTKHPKSANAFVEYMTRPEVMQKYYDVTX3SPTAIEGVK0AGEDSPIAGMTEYAFTDRHLW 
DFHTLTMNYVLTGDKQGMVNDLNAFFNPMKADVDZ 

ID207 - 4123 . 1 

ATGAAGAAAATCAAACCGCATGGACCGTTACCAAGTCAGACTCAGCTAGCTTATCrrGGGAGATGAACTAGCA^ 

40 TATCCACTTCGGTCCTAATACCTTTTATGACCAAGAATGGGGGACTGGACAGGAGGATCCTGAGCGCTTTAACCCGA 
GTCAGTTGGATGCGCGTGAGTGGGTTCGTGTGCTCAAGGAAACGGGCTTCAAAAAGTTGATTTTGGTGGTCAAGCAC 
CACGATGGCTTTGTCCTTTATCCGACAGC7CACACAGATTATTCGGTTAAGGTCAGTCCTTGGAGGAGAGGAAAGGG 
CGACTTGCTCCTTGAAGTATCCCAAGCTGCCACAGAGTTTGATATGGATATGGGGGTCTACCTGTCACCGTGGGATG 
CCCATAGTCCCCTCTATCATGTGGACCGAGAAGCGGACTACAATGCCTATTATCTGGCTCAGTTGAAGGAAATCTTA 

45 TCAAATCCTAACTATGGGAATGCTGGTAAGTTCGCTGAGGTTTGGATGGATGGTGCCAGAGGAGAGGGCGCGCAAAA 
GGTTAATTATGAATTTGAAAAATGGTTTGAAACCATTCGTGACCTGCAGGGCGATTGCTTGATTTTTTCAAC^^ 
GCACCAGTATCCGCTGGATTGGCAATGAACGAGGGTATGCAGGTGATCCACTGTGGCAAAAGGTGAATCCTGATAAA 
CTAGGAACAGAAGCAGAGCTGAACTATCTTCAGCACGGGGATCCCTCGGGCACGATTTTTTCAATCGGAGAGGCAGA 
TGTTTCCATCCGTCCAGGCTGGTTCTACCATGAGGATCAGGATCCTAAGTCTCTCGAGGAGTTGGTCGAAATCTACT 

50 TTCACTCAGTAGGGCGAGGAACTCCACTCTTGCTTAATATTCCGCCGAATCAAGCTGGGCTCTTTGATGCAAAGGAT 
ATTGAACGACTTTATGAATTTGCGACCTATCGCAATGAGCTCTATAAAGAAGATTTGGCTCTGGGAGCTGAGGTATC 
TGGTCCAGCTCTTTCCGCAGACTTTGCTTGTCGCCATTTGACAGACGGCCTTGAGACCAGCTCTTGGGCAAGCGATG 
CAGACTTGCCCATCCAGTTAGAACTCGACTTAGGTTCTCCTAAAACTTTTGATGTAATTGAGTTAAG^^ 
AAGCTAGGGCAACGAATCGCTGCTTTTCATGTGCAAGTAGAGGTGGATGGTGTCTGGCAGGAGTTTGGTTCGGGTCA 

55 TACTGTTGGTTACAAACGTCTCTTACGAGGAGCAGTTGTTGAGGCACAGAAGATACGTGTAGTCATTACAGAATCAC ^ 

AGGCTTTGCCTTTGTTGACCAAGATTTCCCTTTATAAAACTCCTGGATTATCAAAAAAAGAAGTTGT^ 
GCATTTGCAGAAAAAAGCCTAGCTGTGGCIAAAGGGAGAAAATGCCTATTTTACAGTTAAGCGCAGAGAATG 

TCCTTTAGAAGCTAAGATTTCGATTCAACCGGGGACAGGTGTCCATGGTGTCGCCTATCAGGATGAGATTCAAG^ t 
TTGCGTTTCAAACTGGTGAGACTGAAAAAAGTCTGACGCTACCAACCTTGTATTTCG CAGGAG ATAAAACCTTGGAT 
60 TTCTATCTGAACCTAACGGTGGATGGTCAGCTTGTGGATCAACTTCAAGTCCAAGTTTCATAA 
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MKKIKPHGPLPSQTQIAYLGDEIJyiFIHFGPOTFyDOEWGTGQEDPERFNPSOI^AREWVRVL^ 
HDGFVL YPTAHTD YS VKVS PWRRGKGDLLLE VSQAATE FDMDMGVYLS P WDAHS P L YHVDRE AD YNAYY LAQL 
SNPNYGNAGKFAEVWMDGARGEGAQKVNYEFEKWFETIRDI^DCLIFSTEGTSIRWIGNERGYAGDPLWQK^ 

5 LGTEAELl^LQHGDPSGTI FS IGEADVS IRPGWFYHEDQDPKSLEELVEI YFHSVGRGTPLLLKI PPNQAGLFDAKD 

lERLYEFATYRNELYlCEDLALGAEVSGPAI^SADFACRHLTDGLETSSWASDADLPIQLELDLGSPKTFDVIELREDL 
KI^RIAAFHVQVEVDGVWQEFGSGHTVGYKRLLRGAWEAOKIRWITESQALPLLTKISLYKTPGI^KK^^ 
AFAEKSIAVAKGENAYFTVKRRECSGPLEAKISIQPGTGVHGVAYQDEIQVI^FQTGETEKSLTLPTLYFAGDKTI^ 
FYLNLTVDGQLVDQLQVQVS2 

10 

TD2Q8 - 4125.12 

ATGCTTGAAAGACrGAAAAGAATACATTATATGTTTTGGATCAGTTTAATTTTTATGATTTTCCCCATOT 

AGTGACTGGGTGGCTTTCTGCCTGGCATTTATTGATTGATATTCTATTTGTAGTGGCATATTTGGGTGTTTTAACAA 

CTAAGAGCCAGCGCCTATCTTGGCTATATTGGGGCCTCATGCTGACTTATGTAGTTGGGAATACTGCCTTTGT TGCT 

1 5 GTTAATTATATCTGGTTTTTCTTTTTCCTATCCAATCrrCTTAAGTTATCATTTCAG^^ 

ACATGTCTGGACTTTTCTTCTTGCTCAAGTCCTTGTTGTGGGGCAACTGTTGATriTTCAGAGAAT 
TTCTATTCTATCTACTTGTAATTCTTACTTTTGTCGATTTAATGACTTTTGGATTGGTTCGGATTCGTATT^^ 
GATTTGAAAGAAGCTCA.GGTCAAGCAAAATGCTCAGATAAATCTATTGCTTGCTGAAAATGA^ 
TCAGGATTTGCATGATAGTCTGGGACATACCTTTGCTATGCTGAGTGTCAAGACAGATTTAGCCrrGCAGTTATT^ 

20 AGATGGAGGCTTATCCLACAGGTGGAAAAGGAATTAAAAGAAATTCACCAGATCAGCAAGGATCCATGA 



MLERLKRIHYMFWISLIFMIFPILSVVTGWLSAWHLLIDILFWAYLGVLTTKSORIiSWLYWGLMLTYWGNTAFV^ 
VNYIWFFFFLSNLI^YHFSTOSLKSLHWTFLIAQVLWGQLLIFQRIEVEFLFYLLVILTFVDLMTFGLVRIRIVE 
25 DLKEAQVKQNAQINLLLAENERSRIGQDL^DSLGHTFAMLSVKTTDIJUJDLFQMEAYPQVEKE 

ID209 * 4126.3 

ATGAATGATAAGTTAAAAATCTTCTTGTTGCTAGGAGTATTTTTTCTAGCCATAACCGGTTTCTATGTTCTATTGAT 
ACGAAATGCAGGGCAGACAGATGCCTCGCAAATTGAAAAGGCGGCAGTTAGCCAAGGAGGAAAAGCAGTGAAAAAAA 

30 CAGAAATTAGTAAAGACGCAGACTTGCACGAAATTTATCTAGCTGGAGGTTGTTTCTGGGGAGTGGAGGAATATTTC 
TCACGTGTTCCCGGGGTGACGGATGCCGTTTCAGGCTATGCAAATGGTAGAGGAGAAACAACCAAGTACGAATTGAT 
TAACCAAACAGGTCATGCAGAAACCGTCCATGTCACCTATGATGCCAAGCAAATTTCTCTCAAGGAAATCCTGCTTC 
ACTATTTCCGCATTATCAATCCAACCAGCAAAAATAAACAAGGAAATGATGTGGGGACCCAGTACCGTACT 
TATTACACAGATGACAAGGATTTGGAAGTGATTAACCAAGTCTTTGATGAGGTGGCTAAGAAATACGATCAACCT 

3 5 AGCAGTTGAAAAGGAAAACTTGAAGAATTTTGTGGTGGCTGAGGATTACCATCAAG ACTATCTCAA 

ATGGCTACTGCCATATCAATGTTAATCAGGCGGCCTATCCTGTCATTGATGCCAGCAAATATCCAAAACCAAGTGAT 
GAGGAATTGAAAAAGACCCTGTCACCTGAGGAGTATGCAGTTACCCAGGAAAATCAAACAGAACGAGCTTTCTCA^ 
CCGTTACTGGGATAAATTTGAATCCGGTATCTATGTGGATATAGCAACTGGGGAACCTCTCTTTTCATCAAAAGACA 
AATTTGAGTCTGGTTGTGGCTGGCCTAGTTTTACCCAACCCATCAGTCCAGATGTTGTCACCTACAAGGAAGATAAG 

40 TCCTACAATATGACGCGTATGGAAGTGCGGAGCCGAGTAGGAGATTCTCACCTTGGGCATGTCTTTACGGATGGTCC 
ACAGGACAAGGGCGGCTTACGTTACTGTATCAATAGCCTCTCTATCCGCTTTATTCCCAAAGACCAAATGGAAGAAA 
AAGgcTACGCTTATTTACTAGATTATGTTGATTAA 

45 I^KLKIFLLLGVFFIJ^ITGFYVLLIRNAGQTDASQIEKAAVSQGGKAVKKTEISKDADI^EIYLAG 

SRVPGVTDAVSGYANGRGETTKYELINQTGHAETVHVTYDAKQISLKEILLHYFRIINPTSKNKOGNDVGTQYRTGV 
YYTDDKDLEVINQVFDEVAKKYDQPIJVVEKENLKNFWAEDYHQDYLKKNPNGYCHI>n;NQAAYPVrDASIC^^ 
EELKKTLSPEEYAVTQENQTERAFS^rRYWDKFESGIrvTiIATGEPLFSSKDKFESGCGWPSFTOPISPDVVTY 
SYNMTRMEVRSRVGDSHLGHVFTDGPQDKGGLRYCINSLSIRFIPKDQMEEKGYAYLLDYVDZ 

50 

ID210 > 4127.1 

ATGAAAAAGAAATGGATGTATTATGCTGCTTGTTCTTCTAATGAATCTGCCGATGACAGTTCATCTGATAAAGGAGA 

CGGCGGTTCGCTAGTCGTTTATTCACCAAACTCAGAGGGCTTAATTGGAGCAACTATTCCTGCCTTrGAAGAA 

ATGGTATCAAAGTAGAACTGATTCAAGCTGGTACTGGAGAACTTTTCAAAAAACTAGAGTCAGAAAA^ 

55 GTAGCTGATGTTATCTTTGGTGGTTCTTATACACAATATACTACCCACGGAGAACTCTTTGAAAACTATACTTC^^ 
AGAAAATGATAATGTTATCAAAGAATATCAAAACACAACTGGCTACTCTACTCCTTATACACTAGATGGTAGTGTT^ 
TAATCGTCAACCCTGATTTAACTAAAGGCATGAACATCGAAGGATATAACGATCTTTTCAAACCTGAACTAA^ 
AAAATCGCAACTGCTGACCCAGCAAACTCTTCTAGCGCCTTTGCTCAATTAACAAATATGCTA^^ 
TTACAAAGATGATAAGGCTTGGTCTTATGTAAAAGATCTTTTCACACTTATTGATGGTAAAATCGGT 

60 CTAGTGTCTATAAAGTAGTCGCTGATGGAGAAATGGCTGTTGGTCTCTCTTATGAAGATCCAGCAGTTAAACTCTTA 



AATGACGGAGCTAACATTAAGGTAGTCTATCCAAAAGAAGGAACCGTCTTCCTACCTGCTAGTGCTGCTATCGTTAA 

AAAATCTAAAAATATGGAAAATGCCAAGAAATTTATCGATTTTATTATCTCTCAAGAAG 

CAACCACTACTAACCGTCCTGTTCGTAAAAATGCTAAAACAAGCGAAAACATGAAACCJVATTGACAA^ 

CTCACTGAAGATTATGATTATGTCATCAAGAATAAATCAGATATCGTTAAGAAATACAACGAAGTCTTTAC^ 

CCAATCTAAACAGTAA 



MKKKWMYYAACSSNESADDSSSDKGDGGSLWYSPNSEGLIGATIPAFEEKYGIKVELIQAGTGELFKKLESEKEVP 
VADVI FGGS YTQYTTHGELFEOTTSKENDWI KZYQ^^rTGYSTPyTLDGSVLI WPDLTKGMNIEGVOT 
KIATADPANSSSAFAOLTNMUDAOGGYKDDKAWSYVKDLFTLIDGKIGSSSSSVYKWADGEMAVGLSY 
iTOGANIKVVTPKEGTVFI.PASAAIVKKSIC^mENAKKFIDFIISOEVQDTLGTTTTO 
LTED YD YV I KNKSD I VKKYNE VFTD I QS KQZ 

XD21I - 4127.2 

atgagtgagatcaaaattattaacgccaaaaaaatctaccacgatgtccctgttattgagaatttgaacattac^ 

tccaaaaggaagtctctttacccttct7ggagcttcaggatgtgggaaaacgacccttcttcgtatgattgca<^^ 

tcaacagtatcgaaggtggagaattttacttcgatgatacaaaaatcaataatatggaacccagcaaacgcaatatc 

gggatggttttccaaaactacgctattttcccacatttgactgtccgXgacaacgttgc^^ 

gaaggtrccaaaagaagaattgattcaacagaccaacaagtatcttgaacrrcatgcaaattgc 

gaaagcccgataaactcagtggtggacaacaacaacgtgtcaccttggcatgcgccttagcggttaatccaagtgct 

ctccrrcatggacgagccacttagtaatctggaggccaaacttcgcttggatatgcgtcaagccatccgagaaatcca 

ACACGAAGTGGGAATTACAACTGTTTA7GTAACCCACGACCAA.GAAGAAGCCATGGCTATTTCAGACCA^ 

ttatgaaagatggggtgatccaacaaatcggccgaccaaaagaactctatcataaaccagctaatgagtttgtggca 

acctttXtcggacgcacaaatattatccctgccaatcttgaaaaacggagcgacggcgcttatatcgtctt^ 

tggctatgcccttcgaatgccagctcttgatcaggttgaggagcaagctattcatgtaagcattcgtcccgaagagt 

ttatcaaagatgaatctggagatattgaaggaactattagagatagcgtctatcrrggactaaa 

attgagacaggttttgcctcaaaaattcaagttagtgaagaatcaacttttgaagaagatctac^^ 

tattcgtctacgaatcaatacggaaaaattaaacatcttitctgcagatggttcccaaaacct 

accatggaacgtaa 

MSEIKIINAKKIYKDVPVIENLNITIPKGSLFTLLGASGCGKTTLLRMIAGFNSXEGGEFYFDDTKIinJM 

G^fVFQNYAIFPHLTVRD^TVAFGI^OKKVPKEELIQQTNKYLElJ^QIAOYADRKPDKI.SGGQQQRVTM 

liLMDE P LSNLSAKLRLDMROAI RE I QHEVG ITTVYVTHDQEE AMA I SDQ I AVMKDGVI QQI GRPKELYHKPANE FVA 

TFrGRTNIIPANLEKRSDGAYIVFSDGYALRMPALDQVEEQAIHVSIRPEEFIKTiESGDIEGTIRDSVYLGLNTDYF 

IETGFASKIQVSEESTFEEDL.QKGNRIRLRXNTQKLNIFSADGS0NLIKGVNHGT2 

ID212 - 4136.1 

ATGAAGAAAAAATTATTGGCAGGTGCCATCACACTATTATCAGTAGCAACTTTAGCAGCTTGTTCGAAAGGGTCAGA 

AGGTGCA.GACCTTATCAGCATGAAAGGGGATGTCATTACAGAACATCAATTTTATGAGCAAGTGAAAAGCAACCCT 

CAGCCCAACAAGTCTTGTTAAATATGACCATCCAAAAAGTTTTTGAAAAACAATATGGCTCAGAGCTTGAT^ 

GAGGTTGATGATACTATTGCCGAAGAAAAAAAACAATATGGCGAAAACTACCAACGTGTCTTGTCACAAGCAGGTA^ 

GACTCTTGAAACACGTAAAGCTCAAAT7CGTACAAGTAA?^TTAGTTGAGTTGGCAGTTAAGAAGGTAGCAGA^ 

AATTGACAGATGAAGCCrrATAAGAAAGCCTTTGATGAGTACACTCCAGATGTAACGGCTCAAATCATCCGTCTTAAT 

AATGAAGATAAGGCCAAAGAAGTTCTCGAAAAAGCCAAGGCAGAAGGTGCTGATTTTGCTCAATTAGCCA^ 

TTCAACTGATGAAAAAACAAAAGAAAATGGTGGAGAAATTACCTTTGATTCTGCTTCAACAGAAGTACCT 

TCAAAAAAGCCGCTTTCGCTTTAGATGTGGATGGTGTTTCTGATGTGATTACAGCAACTGGCACACAAGCCTAC^^ 

agccaatattacattgtaaaactcactaagaaaacagaaaaatcatctaatattgatgactacaaagaaaaatt;^ 

AACTGTTATCTTGACTCAAAAACAAAATGATTCAACATTTGTTCAAAGCACTATCGGAAAAGAATTGCAAGC^^ 

atatcaaggttaaggaccaagccttccaaaatatctttacccaatatatcggtggtggagattcaagctcaagcagt 
agtacatcaaacgaatag 

mkkxli^gaitllsvatlaacskgsegadlismkgdvitehqfyeqvksnpsaqqvllmxiokvfekqygselddk 

EVDDTIAEEKKQYGENYORVl^OAGMTLETRKAQIRTSKLVEIJ^VKJCVAEAELTDEAYKKAFDEYTPDV^^ 
NEDKAKEVLEKAKAxIGADFAQLJUaDNSTDEKTKENGGEITFDSASTEVPEQVKKAAFALDVD 

SOYYIVKLTKKTEK^SNIDDYKEKLKTVILTQKQNDSTFVOSIIGKELQAANrKVKDQAFOMIFTQYIGGGDSSSSS 
STSNEZ 
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113213 - 4137.3 

atgaaaaaaaatattaaacaatatgtaaccttaggtactgtagtggtattatcagcatttgtt 
agctcaggagactgaaacttctgaagtatcaacaccaaagttggtgcaacctgttgcacc^ 
aagtacaacctacatcggataactcttcggaagttactgtacaacctcgaacagttgaaactactgct 
5 tcitctacagcggaagaaactcctgtcrragaaaaaaataatgttactttaacagggggcggagaaaat 
agagttaaaggataaatttactagcggtgactttactgtagtgattaagtacaatcagtcaagtgag;^ 
aagctctgtttggaatatctaattccaaacccggtcaacaaaatagttatgtagatgtgttccttagagac^ 

GAGTTGGGGATGGAAGCGCGTGATACITCTTCCAATAAAAATAACCTAGTATCCAGACCTGCTT^ 
GTACAAACAAGAGGCTGTGACTAACACTGTTGCAGTAGTAGCAGArrCAGTCAAAAAAACATACT 

10 ATGGTACAAAAGTAGTAGAAAAGAAAGTGGATAATTTCCTAAACATCAAGGATATTAAAGGTATTGACT 
CTTGGGGGAGTGAAACGTGCAGGAAAAACGGCGTTTGGTTTTAACGGAACACTAGA 
TGCATTGGATGAAGAAACTGTTAAAAAGATGACAACAAACGCTGrrACTGGACATTTAATCT 
CAACAGGTTCTAACTATTTCCGTATTCCAGTTCTGTATACTTTTAGCAATGGTCGGGTATTTTC^ 
CGTTACGGTGGAACTCATGAi" i i'Cn'GAATAAAATTAATATTGCTACAAGTTATAGTGATGATAATGGTAAGACATG 

15 GACTAAACCAAAATTAACACTGGCATTCGATGATriTGCGCCj\GTACCATTAGAATGGCC^ 

GTGACTTACAAATCAGCGGTGGTGCAACCrrATATTGACTCTGTTATTGTTGAAAAAAAGAACAAACAAGTACT 
TTTGCTGATGTGATGCCTGCTGGAGTAAGTTTTAGAGAAGCAACTAGAAAAGATTCAGGTTATAAACAAATTGATGG 
TAATTATTACCTTAAATTAAGGAAACy^GGTGATACTGATTACAATTATACTArTCGTGAGAATGGTJ^^ 
ACGATCGTACCAACAGACCAACTGAATTTTCAGTAGATAAAAATTTCGGTATTAAACAAAATGGTAATTATTTGACG 

20 GTAGAGCGG 

MKJQ^IKOYVTLGTVWLSAFVANSVAAQETETSEVSTPKLVQPVAPTTPISEVQPTSDKSSEVTVOPRTVETTO 
SSTAEETPVLEKNNVTLTGGGENVTKELKDKFTSGDFTWIKYNQSSEKGLQALFGISNSKPGQQNSYVDVFLiy^ 
ELGMEARDTSSNKNNLVSRPASWGKYKQEAVTNTVAWACSVKKTYSLYANGTICWEKKTO 
25 LGGVKRAGKTAFGFNGTLENI KFFNSALDEETVKKMTTNAVTGHLI YTANDTTGSNYFRI PVLYTPSNGRVPSSIDA 

RYGGTHDFLNKINIATSYSDDNGKTWTKPKLTIAFDDFAPVPLEWPREVGGRDLQISGGATYIDSVIV^KKNKQVLM 

FADVMPAGVSFREATRroSGYKQIDGNYYLKLRKQGDTDYm^TIRENGTVYDDRTNRPTEFSVDKNFGI 
VER 

30 ID214 - 4185 

ATGAAAAAATTTAGCCTATTACTAGCTATCCTACCATTTTTGGTTGCCTGTGAGAATCAAGCTACACCCAAAGAGAC 
TAGCGCTCAAAAGACAATCGTCCTTGCTACAGCTGGCGACGTGCCACCATTTGACTACGAAGACAAGGGCAATCTGA 
CAGGCTTTGATATCGAAGTTTTAAAGGCAGTAGATGAAAAACTCAGCGACTACGAGATTCAATTCCi^ 
TGGGAGAGCATCTTCCCAGGACTTGATTCTGGTCACTATCAGGCTGCGGCCAATAACTTGAGTTACACAAAAGAGCG 

35 TGCTGAAAAATACCTTTACTCGCTTCCAATTTCCAACAATCCCCrrCGTCCTTGTCAGCAACAAGAAAAATCCT^ 
CTTCTCTTGACCAGATCGCTGGTAAAACAACACAAGAGGATACCGGAACTTCTAACGCTCAATTCATCAATAACT 
AATCAGAAACACACTGATAATCCCGCTACAATTAATrrTTCTGGTGAGGATATTGGTAAACGAATCCTAGACCTTGC 
TAACGGAGAGTTTGATTTCCTAGTTTTTGACAAGGTATCCGTTCAAAAGATTATCAAGGACCGTGGT^ 
CAGTCGTTGATTTACCTTCTGCAGATAGCCCCAGCAATTATATCATTTTCTCAAGCGACCAAAAAGAGTTTAAAGAG 

40 CAATTTGATAAAGCGCTCAAAGAACTCTATCAAGACGGAACCCTTGAAAAACTCAGCAATACCTATCTAGGTGGTTC 
TTACCTCCCAGATCAATCTCAGTTACAATAA 



MKKFSLLIAILPFLVACENQATPKETSAQKTIVLATAGDVPPFDYEDKGNLTGFDIEVLKAVDEKLSDYEIQFQRTA 
45 W£SIFPGU:)SGHYQAAANNLSYTKERAEKYLYSLPrSNNPLVLVSNKKNPLTSLDQIAGKTTQEDTGTSNAOFIN^^ 
N0KHTDNPATINFSGEDIGKRIIZ)IJ^GEFDFI.VFDICVSVOKIIKDRGLDLSVVDLPSADSPSNYIIFSSDOKEFKE 
OFDKALKELYQDGTLEKLSNTYLGGSYLPDQSQLOZ 

ID215 - 4211.1 

50 ATGAAAAAAAATAGTTTATATATCATATCCTCACTCrrTTTTTGCTTGTGTCTTATTTGTCTATGCTACGGCGACGAA 
TTTTCAAAACAGTACCAGTGCTAGGCAGGTAAAAACGGAAACCTATACTAATACAGTAACAAATGTCCCTATTGACA 
TACGCTATAATAGTGATAAGTATTTTATTAGCGGTTTTGCTTCAGAAGTATCAGTGGTCTTGACTGGTGCAAATCGC 
CTATCGCTAGCTAGTGAAATGCAAGAAAGTACACGTAAATTCAAGGTTACTGCTGACCTAACAGATGCCGGTGTTG 
AACGATTGAAGTTCCTTTGAGCATTGAAGATTTACCCAATGGGCTGACCGCTGTGGCGACTCCGCAAAAAA 

55 T^CAAGATTGGTAAGAAGGCTCAGAAGGATAAGGTAAAGATTGTACCAGAGATTGACCCTAGTCAAATTGATAGTCGG 

GTACAAA7TGAAAATGTCATGGTGTCAGATAAAGAAGTGTCTATTACGAGTGACCAAGAGACATTGGATAGAATTGA 

TAAGATTATCGCTGTTTTGCCAACTAGCGAACGTATAACAGGTAATTACAGTGGTTCAGTACCTTTGCAGGCAATCG 

ACCGCAATGGTGTTGTCTTACCGGCAGTTATCACTCCGTTTGATACAATAATGAAGGTGACTACAAAACCAGTAGCA 

CCAAGTTCAAGCACATCAJ^TTCAAGTACAAGCAGTTCATCGGAGACATCTTCGTCAACGAAAGCAACTAGT^ 
OU AACGAATTAA 
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MKKNSLYI I SSLFFACVLrVYATATNFONSTS ARQVKTETYTNTVTNVP I D I RyNSDKY FI SGFASEVS WLTGANR 
I^IJ^EMOESTRKFKVTADLTDAGVGTIEVPLSIEDLPNGLTAVATPQKITVKIGKKAQKDKVKIVPEIDPSQI 
VQIENVKVSDKEVS ITSDQETLJDRIDKI I AVLPTSERITGNYSGSVPLQAI DRNGWLPAVITP FDTIMKVTTKPVA 
5 PSSSTSNSSTSSSSETSSSTKATSSKTN2 

T0516 - 4127.3 

ATGTTGATTGGCGAAGGGTATCGGA C TTTCCCTGTCCTGATTTATACCCAATTTATTAGCGAGGTTGGAG^ 
TGCTTTTGCAATTATGGCGATTATCATTGCCTTGGCAATTTTCCT^ 
1 0 TCAGCATGAATCTGCTCCATCCAATTGAGCCTAAAAAAACTACAAAAGGAAAAATGGCTGC^ 
TACGGAATTATCTTTATCTCTGTTTTACCTCAAATCTACTTAATTTATAC^ 

ATCTGTTAAAGGTTATTCTCCAAACAGTTACAAGGTAGCTTTCCATCGTATGGGATCTGCTATTTTC^ 
GTATCCCTTTGATTGCCTTAGTTCTAGTTGTTCTATTTGCGACATTTATCrCCTACC^^ 
TTGTTTACAAACTTAATTGACAGCCTCAGTATGGTACCTTATATTGTACCAGGAACCG^ 
1 5 TTCTTCCTTCAATACTGGTCTATTTGGAAGTGGATTTCTTATGATTACAGGGACTGC^^ 

TATCTGCCAGAAGATTACCTTATACTATTCGCTCATCTGTTGCTAGCTTACAACAAATAGC^ 

GCTGCTGAAAGCTTAGGAAGTAGTCGTCTCAATACCTTTGCTAAGATTACAACTCCAATGATGCT 

TTCTGGAGCCATCTTATCTTGA 

20 

MLIG£GyRTFPVI.IYTQFISEVGGNSAFAIMAIIIAIj;.IFLIOKHrANRYSFSMNLIJIPIEPKKTTKGK^^ 
YGIIFISVLPQIyLIYTSFLKTSG^^VSVKGYSPNSYKVAFHR^1GSAIFNTIRIPLIALVLVVLFATFISyLAVRKRN 
LFTNLIDSLSMVPYIVPGTVLGIAFISSFNTGLFGSGFLMITGTAFILIMSLSARRLPYTIRSSVASLQOIAPSIEE 
AAES LGSSRLNTFAKITT PMMLSG 1 1 SGAI LS 2 

25 
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Table 4 
ID301 

ATGAATAAGAAAAAAATGATTTTAACAAGTCTAGCCAGCGTCGCTATCrrAGGGGCTGGT^^ 
5 TACrrTTGTAAGAGCAGAAGAATCTCCACAAGTTGTCGAAAAATCrrCATTAGAGAAGAAATATGAGG^ 

CAAAAGCTGATACTGCCAAGAAAGATTACGAAACGGCTAAAAAGAAAGCAGAAGACGCTCAGAAAAAGTAT^ 

GATCAGAAGAGAACTGAGGAGAAAGCTCGAAAAGAAGCAGAAGCATCTCAAAAATTGAATGATGTGC^ 

TCAAAATGCATATAAAGAGTACCGAGAAGTTCAAAATCAACGTAGTAAATATAAATCTGACGCTGAATATCA^ 

aattaacagaggtcgactctaaaatagagaaggctaggaaagagcaacaggacttgcaaaataaatct 
10 agagcagttgtagttcctgaaccaaatgcgttggctgagactaagaaaaaagcagaagaagcta;^ 

AGTAGCTAAGAGAAAATATGATTATGCAACTCTAAAGGTAGCACTAGCGAAGAAAGAAGTAGAGGCTAAG 
AAATTGAAAAACTTCAATATGAAATTTCTACTTTGGAACAAGAAGTTGCTACT^ 
AAAAAACTTCTTGCTGGTGCGGATCCTGATGATGGCACAGAAGTTATAGAAGCTAAATTAAAAAP^ 
GCTAAACGCTAAACAAGCTGAGTTAGCAAAAAAACAAACAGAACTTGAAAAACTTC^ 
1 5 GTAAGACTCAGGATGAATT AGATAAAGAAGC AG AAG AAGCTG AGTTGGATAAAAAAGCTGATGAACTTCAAAATAAA 

GTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTACTTGGAGGGGCTGATCCTGAAGATC 
TCTTCAAAATAAATTAGCTGCTAAAAAAGCTGAGTTAGCAAAAAAACAAACAGAACCT 

TTGATCCTGAAGGTAAGACTCAGGATGAATTAGATAAAGAAGCAGAAGAAGCTGAGTTGGATAAAAAAGCTGATX^ 
CTTCAAAATAAAGTTGCTGATTTAGAAAAAGAAATTAGTAACCTTGAAATATTAC^ 

20 TGATACTGCTGCTCTTCAAAATAAATTAGCTACTAAAAAAGCTGAATTGGAAAAAACTCAAAA^ 

CTCTTAATGAGTTAGGCCCTGATGGAGATGAAGAAGAAACTCCAGCGCCGGCTCCTCAACCAGAGCAACCA 
GCACCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTCCTGCACCAAAACCAGAGC^ 
TCCAGCTCCAAAACCAGAGCAACCAGCTCCAGCTCCAAAACCAGAGCAACCAGCTAAGCCGGAGA 
AGCCTACTCAACCAGAAAAACCAGCCACTCCAAAAACAGGCTGGAAACAAGAAAACGGTATGTGGTAT^ 

25 ACTGATGGrrCAATGGCAATAGGTTGGCTCCAAAACAACGGTTCATGGTACTACCTAAACGCTAACGGCGCT 

AACAGGTTGGGTGAAAGATGGAGATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGW 
AAGTATCAGATAAATGGTACTATGTCAACAGCAATGGCGCTATGGCGACAGGCTGGCTCCAAT^ 
TACTACCTCAACGCTAATGGTGATATGGCGACAGGATGGCTCCAATACAACGGTTCATGGTATTACCTC^ 
TGGTGATATGGCGACAGGATGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCrrAACGGTGCTATGGCT 

30 GTTGGGCTAAAGTCAACGGTTCATGGTACTACCTAAACGCTAACGGTTCAATGGCAACAGGTTGGGTGAAA^ 

GATACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAGATAAATGGTACTA 

TGTCAATGGCTTAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATAAAGTCAATGCCAATGGTGAATG^ 

AA 

35 ^^kkkmiltsiiasvailgagfvtsqptfvraeespqvveksslekkyeeakakadt 

dqk31teskarkeaeasqklndvalwqnaykeyrev0nqrskyksdaey0kkltevdskiekarke 
ravwp epnaiaetkkkaeeakaee kvakrkyd yatlkvaiajoceveakele i e klq ye i stleqevataqhqvdnl 
kkl1agadpddgtevieaklkkgeaelnakqaelakk0telek1.u:^sldpegktcdeldkem 
vadlek2:isnleillggadpsddtaal0nkij^aicka£lakkqteleklix}sld 

lonkvadlek^isnleillggadseddtaalqnkxatkkaelektqkeldaalnelgpdgdeeetpapapqpeqpap 
apkpsopapapkpeopapapkpeqpapapkpeqpapapkpeqpakpekpaeeptqpekpatpktgwkoengmwyfyn 
tdgsmaigwlonngswyylnangamatgvatkdgdtwyyleasgamkasqwfkvsdkwyyvnsng;^ 

YYU^ANGD^lATGWLQYNGSWYYLNA^IGD^lA.TGWAKVNGSWYYI^ANGAMATGWAKVNGSWYYLN 
DTV^LEASGAMKASQWFKVSDKWYYVNGI^AIJlWTTVI:>GYKVNA^XGE 

45 

ID3 02 

ATGT7TGCATCAAAAAGCGAAAGAAAAGTACATTATTCAATTCGTAAATTTAGTGTTGGAGTAGCTAGTGTAGTTGT 

TGCCAGTCTTGTTATGGGAAGTGTGGTTCATGCGACAGAGAACGAGGGAGCTACCCAAGTACCCACTTCCT 

'2G<5CAAATGAAAGTCAGGCAGAACAAGGAGAACAACCTAAAAAACTCGATTCAGAACGAGATAAGGC^ 

DV GTCGAGGAATATGTAAAAAAAATAGTGGGTGAGAGCTATGCAAAATCAACTAAAAAGCGACATACAATTACT 
TCTAGTTAACGAGTTGAACAACATTAAGAACGAGTATTTGAATAAAATAGTTGAATCAACCTCAGAAAGCC^ 
AGATACTGATGATGGAGAGTCGATCAAAAGTAGATGAAGCTGTGTCTAAGTTTGAAAAGGACTCATCTTCTTCGTCA 
AGTTCAGACTCTTCCACTAAACCGGAAGCTTCAGATACAGCGAAGCCAAACAAGCCGACAGAACCAGGAG 
AGCAGAAGCTAAGAAGAAGGTTGAAGAAGCTGAGAAAAAAGCCAAGGATCAAAAAGAAGAAGATCGTCCT 

:>:> <^CCATTACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATGTGG^^ 

GTAAAAGTGAAAGCTAACGAACCTCGAGACGAGCAAAAAATTAAGCAAGCAGAAGCGGAAGTTGAGAGTAAAO^ 
TGAGGCTACAAGGTTAAAAAAAATCAAGACAGATCGTGAAGAAGCAGAAGAAGAAGCTAAACGAAGAGCAGATGCTA 
AAGAGCAAGGTAAACCAAAGGGGCGGGCAAAACGAGGAGTTCCTGGAGAGCTAGCAACACCTGATAAAAAAGAAAAT 
GATGCGAAGTCTTCAGArrCTAGCGTAGGTGAAGAAACTCTTCCAAGCCCATCCCTGAAACCAGAAAA^ 

OU AGAAGCTGAGAAGAAGGTTGAAGAAGCTAAGAAAAAAGCCGAGGATCAAAAAGAAGAAGATCGCCGTAACTACCCAA 



40 
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CCAATACTTACAAAACGCTTGAACTTGAAATTGCTGAGTCCGATCTGGAAGTTAAAAAAGCG^^ 

AAAGAGGAAGCTAAGGAACCTCGAAACGAGGAAAAAGTTAAGCAAGCAAAAGCGGAAGTTGAGAGTAAAAAAG^ 

GGCTACAAGGTTAGAAAAAATCAAGACAGATCGTAAAAAAGCAGAAGAAGAAGCTAAACGAAAAGCAGCAGAAG^ 

ATAAAGTTAAAGAAAAACCAGCTGAACAACCACAACCAGCGCCGGCTCCAAAAGCAGAAAAACC^^ 

AAACCAGAGAATCCAGCTGAACAACCAAAAGCAGAAAAACCAGCTGATCAACAAGCT 

ATCAGAAGAAGAATATAATCGCTTGACTCAACAGCAACCGCCAAAAACTGAAAAACCAGCACAA 

AAACAGGCTGGAAACAAGAAAACGGTATGTGGTACTTCTACAATACTGATGGTTCAATGGCGA^^ 

AACAATGGCTCATGGTACTACCTCAACAGCAATGGCGCTATGGCGACAGGATGGCTCCAAA^ 

CTATCTAAACGCTAATGGTTCAATGGCAACAGGATGGCTCCAAAACAATGGTTCATGGTACT^^ 

GTTCAATGGCGACAGGATGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTTC^ 

TGGCTCCAATACAATGGCTCATGGTACTACCTAAACGCTAATGGTGATATGGCGACAGGTTGGGTGAAAGAT^^ 

TACCTGGTACTATCTTGAAGCATCAGGTGCTATGAAAGCAAGCCAATGGTTCAAAGTATCAG 

TCAATGGCTCAGGTGCCCTTGCAGTCAACACAACTGTAGATGGCTATGGAGTCAATGCC^ 
TAA 

MFASKSERKVHYSIRKFSVGVASVWASLVMGSWHATENEGATQVPTSSNRANESQAEQGEQPra^ 

VEEYVKKIVGESYAKSTKKRKTITVALVNEU^NIKNEYI^KIVESTSESQLQIIJWESRSK^ 

SSDSSTKPEASDTAKPNKPTEPGEKVAEAKKKVEEAEKKAKDQK^EDRRNYPTITYKTLELEIAESDVEV^^ 

VKVKANEPRDEOKIKQAEAEVESKQAEATRLKKIKTDREEAEEEAKRRADAKEOGKPKGRAKRGVPGEIATPDK^ 

DAKSSDSSVCEETLPSPSLKPEKK^AEAEKK^EEAKKKAEDOKEEDRRNYPTNTYKTLELEIAESDV^ 

KSEAKEPRNEEKVKOAKAEVESK^ATRLEKIKTDRKKAEEEAKRKAAEEDK^KEKP^^ 

KPEKPAEQPKAEKPADMAEEDYARRSEEEYNRLTQQQPPKTEKPAQPSTPKTGWKQENGmYFYNTDGSMATGWLQ 

NNGSWYYI^SNGAMATGWLQNNGSWYYI^ANGSMATGWLONNGSWYYI^ANGSMATGWLOYNGSWYYI^ANGS^^^ 

WLQYNGSWYYl^ANGDMATGWKDGDTWYYLEASGAMKASQWFKVSDK^YYTOGSGAIAVNT^ 
Z 

XD3 03 

ATGGTAAAAAGACGTATAAGGAGAGGGACGAGAGAACCTGAAAAAGTTGTTGTTCCTGAGCAATCATCTATTCCT 

GTATCCTGTATCTGTTACATCTAACCAAGGAACAGATGTAGCAGTAGAACCAGCTAAAGCAGT^^ 

ACTGGAAACAAGAAAATGGTATGTGGTATTTTTATAATACTGATGGTTCCATGGCAACAGGTTGGG 

AGTTCATGGTACTACCTCAACAGCAACGGTTCTATGAAAGTCAATCAATGGTTCCAAGTTGGTGGTA^ 

TGTAAATACATCGGGTGAGTTAGCGGTCAATACAAGTATAGATGGCTATAGAGTCAATGATAATGGTGAATC^ 
GTTAA 

MVKRRIRRGTREPEKVWPEQSSIPSYPVSVTSNQGTDVAVEPAKAVAPTTDWKQENGMWYFYl^ 
SSWYYLNSNGSMKVNQWFQVGGKWYYVNTSGELAVNTS IDGYRVNDNGEWVRZ 

XD3 04 

CTGAATACAAGTTTTGTTCATGCTGCTGATGGGATTCAATATGTCAGAGATGATACTAGAGATAAAG^^^ 
AGAGTATGATGACGCTGACAATGGGGATATTATTGTAAAAGTAGCGACTAAACCTAAGGTAGTAACCAAGAAA^ 

GGCTATGTAACTACGACAAGGACCTACGATGTTAA7CCAGAGACTGGTTATGTTACCGAACAGGTTACTGCT 

AATATGAAGCAGACAATGACCTTTCTGCAGGACAGGAGCAAGAGATTACTCTAG^ 

CAACTCTTGATAAAAGTCAAGAAGTAGAAGAAGTAGGAGAAATTGGTAAATTACTCTTACTAC^^ 

TITYWDGKSGQVTE^TLSOKKDSQTRWKKRTKPQVLVQEIPIETEYLJDGPTLDKSQEVEEVGEIG^^ 
ID3 0S 

AACATCATGGGoAAGAATATGATAGCCAAGCAGAGAAACGAGTCTATTATTTTGAAGATCAGCGTAGTTATCAT^ 
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TTAAAAACTGGTTGGATTTATGAAGAGGGTCATTGGTATTATTTACAGAAGGATGGTGGCTrrGATTCGCGCATCAA 
CAGATTGACGGTTGGAGAGCTAGCACGTGGTTGGGTTAAGGATTACCCTCTTACGTATGATGAAGAGAAGCTAAAAG 
CAGCTCCATGGTACTATCTAAATCCAGCAACTGGCATTATGCAAACAGGTTGGCAATATCTAGG^ 
TACCTCCATTCGTCAGGAGCTATGGCAACTGGCTGGTATAAGGAAGGCTCAACTTGGTACTATCTAGATGCTGAAAA 
5 TGGTGATATGAGAACTGGCTGGCAAAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTG 
GTTGGTATCAGGAAAGTTCGACTTGGTACTATCTAAATGCAAGTAATGGAGATATGAAAACAGGCTGGTTCCAAGTC 
AATGGTAACTGGTACTATGCCTATGATTCAGGTGCTTTAGCTGTTAATACCACAGTAGGTGGTTACTACTTAAACTA 
TAATGGTGAATGGGTTAAGTAA 

10 MKLLKK^WQIAIJVTFFFGLIATNTVFADDSEGWQFVQENGRTYYKKGDLKSTYWRVIDGBn^ 

IPAPHKGVTIGPSPRIEIALRPDWFYFGQDGVLQEFVGKQVLEAKTATNTNKHHGEEYDSQAEKRVYYFEIXJRSra^ 
LKTGWIYE£GHOTYL0KDGGFDSRINRLTVGELARGWVKI5YPLTYDE£KLKAAPWyYI^PATGI^^ 
YI^SSGAMATGWYKSGSTWYYLDAENGDMRTGWQNUJNK^YLRSSGAMATGWYQESSTWYYU^ASN^ 
NGNWYYAYDSGALAWTTVGGYYLNYNGEWVK2 

15 

XD3Q6 

TTGGCTGGTAGATATGGTTCTGCTGTTCAGTGTACAGAAGTGACTGCCTCAAACCTTTCAACAGTTAAAACTAAAGC 
TACGGTTGTAGAAAAACCACTGAAAGATrrrAGAGCGTCTACGTCTGATCAGTCTGGTTGGGTGGAATGX;^ 
AATGGTATTTCTATGAGTCTGGTGATGTGAAGACAGGTTGGGTGAAAACAGATGGTAAATGGTACTATTTGAATGAC 
20 TTAGGTGTCATGCAGACTGGATTTGTAAAATTTTCTGGTAGCTGGTATTACTTGAGCAATTCAGGTGCTATGTTTAC 
AGGCTGGGGAACAGATGGTAGCAGATGGTTCTACTTTGACGGCTCAGGAGCTATGAAGACAGGCTGGTACAAGGAAA 
ATGGCACTTGGTATTACCTTGACGAAGCAGGTATCATGAAGACAGGTTGGTTTAAAGTCGGACCACACTGGTACTAT 
GCCTACGGTTCAGGAGCTTTGGCTGTGAGCACAACAACACCAGATGGTTACCGTGTAAATGGTAATGGTGAATGGGT 
AAACTAG 

25 

IJVGRYGSAVQCTEVTASNLSTVrrKATVVEKPLKiDFRASTSDQSGWESNGKWYFYESGDVK^ 

LGVMQTGFVKFSGSWYYLSNSGAMFTGWGTDGSRWFYFDGSGAMKTGWYKENGTWYYLDEAGIMKTGWFKVGPHWYY 

AYGSGALAVSTTTPDGYRVNGNGEWVN2 

30 XD3 07 

ATGAAAATTTTGAAAAAAACTATGCAAGTTGGACTGACAGTATTTTTCTTTGGTTTGCTAGGGACCAGTACAGTATT 
TGCAGATGATTCTGAAGGATGGCAGTTTGTCCAAGAAAACGGAAGAACCTACTACAAAAAGGGGGACCTCAAAGAAA 
CCTACTGGCGAGTGATTGATGGTAAGTACTATTATTTTGATTCTCTATCTGGAGAGATGGTTGTCGGCTGGCAATAT 
ATCCCGTTTCCATCTAAAGGTAGTACAATTGGTCCTTACCCAAATGGTATCAGATTAGAAGGTTTTCCAAAGTCAGA 

35 GTGGTACTACTTCGATAAAAATGGAGTGCTACAAGAGTTTGTTGGTTGGAAAACATTAGAGATTAAAACTAAAGACA 
GTGTTGGAAGAAAGTACGGGGAAAAACGTGAAGATTCAGAAGATAAAGAAGAGAAGCGTTATTATACGAACTATTAC 
Tl^AATCAAAATCATTCTTTAGAGACAGGTTGGCTTTATGATCAGTCTAACTGGTATTATCTAGC^ 
TAATGGAGAAAACTACCTTGGTGGTGAAAGACGTGCGGGGTGGATAAACGATGATTCGACTTGGTACTACCTAGATC 
CAACAACTGGTAT7ATGCAAACAGGTTGGCAATATCTAGGTAATAAGTGGTACTACCTCCGTTCCTCAGGAGCAATG 

40 GCCACTGGCTGGTATCAGGAAGGTACCACTTGGTArrATTTAGACCACCCAAATGGCGATATGAAAACAGGTTGGCA 
AAACCTTGGGAACAAATGGTACTATCTCCGTTCATCAGGAGCTATGGCAACTGGTTGGTATCAAGATGGTTCAACTT 
GGTACTACCTAAATGCAGGTAATGGAGACATGAAGACAGGTTGGTTCCAGGTCAATGGCAACTGGTACTATGCTTAT 
AGCTCAGGTGCTTTGGCAGTGAATACGACCGTAGATGGCTATTCTGTCAACTATAATGGCGAATGGGTTCGGTAA 

45 MKILKKTMQVGL7VFFFGLLGTSTVFADDSEGWQFVQENGRTYYK2CGDLKETYWRVIDGKYYYFDSLSGEMVVGWQY 
IPFPSKGSTIGPYPNGIRLEGFPKSEWYYFDKNGVLQEFVGWKTLEIKTKDSVGRKYGEKREDSSDKEEKRYYTNYY 
FNQNHSLETGWLYDQSOTYYUUCrSINGENYLGGERRAGWrtroDSTWYYLDPTTGIMQTGWQYLGNKWYYLRSSG^ 
ATGWYQEGTTWYYI^HPNGDMKTGWQNLGNKWYYLRSSGAMATGWYQDGSTWYYLNAGNGDMKTGWFQVNGW 
SSGALAVNTTVDGYSVNYNGSWVR2 

50 

ID3 0 8 

ATGAAAAAGAAATTAACTAGrrTAGCACTTGTAGGCGCTTTTTTAGGTTTGTCATGGTATGGGAATGTTCAGGCTCA 
AGAAAGTTCAGGAAATAAAATCCACTTTATCAATGTTCAAGAAGGTGGCAGTGATGCGATTATTCTTGA^ 

55 GACATTTTGCCATGGTGGATACAGGAGAAGATTATGATTTCCCAGATGGAAGTGATTCTCGCTATCCATGGAGAGAA 
GGAATTGAAACGTCTTATAAGCATGTTCTAACAGACCGTGTCTTTCGTCGTTTGAAGGAATTGGGTGTCCAAAAACT 
TGATTTTATTTTGGTGACCCATACCCACAGTGATCATATTGGAAATGTTGATGAATTACTGTCTACCTATCCAGTTG 
ACCGAGTCTATCTTAAGAAATATAGTGATAGTCGTATTACTAATTCTGAACGTCTATGGGATAATCTGTATGGCTAT 
GATAAGGTTTTACAGACTGCTGCAGAAAAAGGTGTTTCAGTTATTCAAAATATCACACAAGGGGATGCTCATTTTCA 

60 GTTTGGGGACATGGATATTCAGCTCTATAATTATGAAAATGAAACTGATTCATCGGGTGAATTAAAGAAAATTTGGG 
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ATGACAATTCCAATTCCTTGATTAGCXSTGGTGAAAGTCAATGGCAAGAAAATTTACCT^ 

GTTCATGGAGCAGAAGACAAGTATGGTCCTCTCATTGGAAAAGTTGATTTGATGAAGTTTAA^ 

CAACAAATCAAATACCAAGGATTTCATTAAAAATTTGAGTCCGAGTTTGATTGTTCA^ 

GGAAAAATGGTGTTGATAGTGAGTATGTTAATTGGCTCAAAGAACGAGGAATTGAGAGAATCAACGCAGCCAGC^^ 
5 GACTATGATGCAACAGTTTTTGATATTCGAAAAGACGGTTTTGTCAATATTT 

TTTTCAAGCTGGTTGGCATAAGAGTGCATATGGGAACTGGTGGTATCAAGCGCCTGATTCTACA 
TCGGTTGGAATGAAATCGAAGGTGAATGGTATTACTTTAACCAAACGGGTATCTTGTTACAGAATC^ 
TGGAACAATCATTGGTTCTATTTGACAGACTCrGGTGCTTCTGCTAAAAATTGGAAGAAAATCG^ 
TTATTTTAACAAAGAAAACCAGATGGAAATTGGTTGGATTCAAGATAAAGAGCAGTGGTA 

1 0 GTTCTATGAAGACAGGATGGCrrCAATATATGGGGCAATGGTATTACTTTGCTCCATCA<^^ 

TGGGTAAAAGATAAAGAAACCTGGTACTATATGGATTCTACTGGTGTCATGAAGACAGGTGAGATAGAAGTTGCTGG 
TCAACArrATTATCTGGAAGATTCAGGAGCTATGAAGCAAGGCTGGCATAAAAAGGCAAATGAT^ 
AGACAGACGGTTCACGAGCTGTGGGTTGGATCAAGGACAAGGATAAATGGTACTTCTTGAAAGAAAATGG^ 
CTTGTGAACGGTAAGACACCAGAAGGTTATACTGTGGATTCAAGTGGTGCCTGGTTAGTGGATGTTTCGA^ 

15 ATCTGCTACAATTAAAACTACAAGTCATTCAGAAATAAAAGAATCCAAAGAAGTAGTGAAAAAGGATC^ 
AAGAAACGAGTCAACATGAAAGTGTTACAAATTTTTCAACTAGTCAAGATTTGACATC 
GAAACGAGTG7AAACAAATCGGAATCAGAACAGTAG 

MKKKLTSIJU.VGAFLGLSWyGhrVQAOESSGWKIHFIWQEGGSDAIILESNGHFA>fVDTGEDYDFPDGSDSRyPWRS 
20 GIETSYKHV^^TDRVFRRLKELGVOIaDFILVTHTHSDHIG^AmELLSTYPVDRVYLKKYSDSRITNSER^ 

DKVLQTAAEKGVSVIONITQGDAHFQFGDMDIQLYNYENETDSSGELKKIWDDNSNSLISWKWGKKIYLGGDLDN 
VHGAEDKYGPLIGKTOI^KFNHHHDTOfCSNTKDFIK^^I^PSLIVOTSDSLPWK^ 

DYDATVFDIRKDGFVNISTSYKPIPSFQAGWHKSAYGNWWYQAPDSTGEYAVGWNEIEGEWYYFNQTGILLQNQWKK 
WNimW FYLTDSGAS AKl^KXI AGI W YY FNKENQME I GW I QDKEQW Y YLDVDGSMKTGWLQ YMC^ 
25 WKDKETWYYMDSTGVMKTGEIEVAGQHYYLEDSGAMKQGWHKKA^roWYFyKTD^ 

LVNGKTPEGYTVDSSGAWLVDVS I EKSATI KTTSHSE I KESKEWKKDLENKETSQKESVTNFSTSQDLTSST^ 
ETSVNKSESEQZ 

XD309 

30 ATGGAAATTAATGTGAGTAAATTAAGAACAGATTTGCCTCAAGTCGGCGTGCAACCATATAGGCAAGTACACGCAC^ 
CTCAACrGGGAATCCGCATTCAACCGTACAGAATGAAGCGGATTATCACrGGCGGAAAGACCCAG^ 
TCTCGCACATTGTTGGGAACGGTTGCATCATGCAGGTAGGACCTGTTGATAATGGTGCCTGGGACGTTGGGGGCGGT 
TGGAATGCTGAGACCTATGCAGCGGTTGAACTGATTGAAAGCCATTCAACCAAAGAAGAGTTCATGACGGAC^ 
CCTTTATATCGAACTCTTACGCAATCTAGCAGATGAAGCAGGTTTGCCGAAAACGCTTGATACAGGGAGTTTAGCTG 

35 GAATTAAAACGCACGAGTATTGCACGAATAACCAACCAAACAACCACTCAGACCACGTTGACCCTTATCCATATCTT 

gctaaatggggcattagccgtgagcagtttaagcatgatattgagaacggcttgacgat7gaaacaggctggcagaa 
gaatgacactggctactggtacgtacattcagacggctcttatccaaaagacaagtttgagaaaatcaatggcact 
ggtactactttgacagttcaggctatatgcttgcagaccgctggaggaagcacacagacggcaactggtactggttc 
gacaactcaggcgaaatggctacaggctggaagaaaatcgctgataagtggtactatttcaacgaag;^ 
40 gaagacaggctgggtcaagtacaaggacacttggtactacttagacgctaaagaaggcgccatggtatcaaatgcct 
ttatccagtcagcggacggaacaggctggtactacctcaaaccagacggaacactggcagacaagccagaattcaca 
gtagagccagatggcttgattacagtaaaataa 

meinvsklrtdlpqvgvqpyrovhahstgnphstvqneadyhwrkdpelgffshivgngcimqvgpvdngawdvggg 
45 wnaetyaavelieshstkeefmtdyrlyiellrni^eaglpktldtgsiagiktheyctnnopnnhsdhvdp 
akwgisreqfkhdiengltietgwqkndtgywyvhsdgsypkdkfekingtwyyfdssgymladrwrra 
dnsgekatgwkkiadkwyyfneegamktgwkyiodtwyyldakegamvsnafiqsadgtgwyylkpdgti^ 
vepdglitvkz 

50 ID310 

ATGGGCACAACAGGATrrACAATAATTGACTTAATTATCTTGATTGTTTATTTACTTGCGGTGTTGGTTGa 
CTATTTCTCrrAAAAAAGAGATGAAAGGAAAAGAGTTCTTTAAAGGAGATGGTTCGGTTCCTTGGTATGTTACT 
TATCCATTTTTGCCACAATGCTCAGTCCGATTTCCTTCTTGGGACTCGCTGGTAGCTCTTATGCAGGTAGCTGGATT 
TTATGGTTTGCTCAATTAGGGATGGTAGTAGCTATTCCACTGACAATTCGTTTTATCTTACCTATCTTTGC^^ 
^ 5 AGACATCGATACGGCATATGATTACTTGGATAAACGTTTTAATTCTAAAGCACTTCGTATTATTTCAGCACT 
TTATTATTTATCAATTGGGACGTATGTCTATCATTATGTACCTCCCATCAGCTGGTTTATCAGTATTGA 
GACATCAATATTTTGATTATTTTGATGGGTGTAGTTGCAATTGTTTATTCTTATACTGGTGGTCTAAAATCC 
ATGGACAGACTTTATTCAAGGTGTGATTCTGATTAGTGGTGTCGTTTTAGCTTTA 

AAGGTGGCTTTGGTGCAGTAGCAGAAACATTAGCAAACGGGAAATTCCTTGCTGCAAATGAAAAACTTTTCGA 
60 AACTTGCTTTCAAACTCCATCTTTTTAArTGTGATGGGTTCAGGCTTTAC^ 
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AGATTTGGTTCAACGTTTTACTACAACACAAAATATTAAGAAACTTAATAAGATGT^^ 
CACTTGCAACTGCAACAGTCrriTTACTTGATTGGTACAGGC^ 

GCAGCTAGCAATATCCCTCAAGACCAAATCTTTATGTACTTTATTGCATACCAGTTACC^^ 
GATCTTGGCAGCGATTTATGCAGCATCTCAATCAACTATTTCAACAGGTTTGAACTCTGTTGC;^ 
5 TGGATATTCAAGATGTCATTTCTAAAAATATGTCAGACAATCGTCGTACGAAAATTGCACAATTCGTATCT 
GTAGGTTTATTCTCAATTGGTGTTTCCATTGTCATGGCTCACTCAGATATTAAATCT^ 
TTTCATGGGACriTGTACTTGGTCTACTTGGTGGTGTATTTATTCrrTGGATTTGTTTCTA;^^ 
GTGCTTATGCAGCGCTGAITGTATCAACCATCGTCATGGTATTTATTAAATACTTCCTTCCTC^ 
TACTGGGCATATTCATTGATTTCAATCTCTGTATCAGTAGTTTCAGGTTATATTGTATCTGTTC^ 
1 0 AGTATCTGCACCTAAATATACAACGATTCATGATATTACAGAAATTAAAGCGGATTCAAGTTGGGAAGTT^ 
AA 

MGTTGFTIIDLIILIVYLIJ^VLVAGIYFSKKEMKGKEFFKGDGSVPWYVTSVSIFATMI^PISFLGLAGSSYAGSW 
LWFAQLGMVVAIPLTIRFILPIFARIDIDTAYDYLDKRFNSKALRIISALLFIIYQLGRMSIIMyLPSAGLSVLTGI 
15 DINILIILMGWAIVYSYTGGLKSVLWTDFIQGVILISGWUVLFVLIANIKGGFGAVAETl^GKFIJ^ 
NLLSNSIFLIVMGSGFTILSSYASSQDLVQRFTTTQNIKKI^K>a.FTNGVLSLATATVFYLIGTGLW 
AASNIPQDQXFMYFIAYQL-PVGITGLIIJUVlYAASQSTISTGt^SVATSWTlX)IQDVISKNMSDNRRTKIAQFVSI^ 
VGIJSIGVSIVMAHSDIKSAYEWFNSFMGLVLGLLGGVFIiGFVSKKANJCQGAYAALIVSTrVMVF 
YWAYSLISISVSWSGYIVSVLTGNKVSAPKYTTIHDITEIKADSSWEVRH2 

20 

ID311 

ATGAAAATTAATAAAAAATA7CTAGCAGGTTCAGTGGCAGTCCTTGCCCTAAGTGTTTGTTCCTATGAGCT 

TCACCL^AGCTGGTCAGGATAAGAAAGAGTCTAATCGAGTTGCTTATATAGATGGTGATCAGGCTCGTCAAAAGGCA 

AAAACTTGACACCAGATGAAGTCAGTAAGAGGGAGGGGATCAACGCCGAACAAATCGTCATCAAGArrACGGATC^ 

25 GGTTATGTGACCTCTCATGGAGACCATTATCATTACTATAATGGCAAGGTCCCTTATGATGCCATCATCAGTGAAGA 
GCTCCrrCATGAAAGATCCGAATTATCAGrrGAAGGATTCAGACATTGTCAATGAAATCAAGGGTGGTTATGTC^ 
AGGTAGACGGAAAATACTATGTTTACCTTAAGGATGCAGCTCATGCGGATAATATTCGGACAAAAGAAGAGATTAAA 
CGTCAGAAGCAGGAACGCAGTCATAATCACGGGTCAGGAGCTAACGATCATGCAGTAGCTGCAGCCAGAGCCCAAGG 
ACGCTATACAACGGATGATGGGTATATCTTCAATGCATCTGATATCATTGAGGACACGGGTGATGCTTATATCGTTC 

30 CTCACGGCGACCATTACCATTACATTCCTAAGAATGAGTTATCAGCTAGCGAGTTAGCTGCTGCAGAAGCCTATTGG 
AATGGGAAGCAGGGATCTCGTCCTTCTTCAAGTTCTAGTTATAATGCAAATCCAGCTCAACCAAGATTGT^ 
CCACAATCTGACTGTCACTCCAACTTATCATCAAAATCAAGGGGAAAACATTTCAAGCCTT^ 
CTAAACCCTTATCAGAACGCCATGTGGAATCTGATGGCCTTATTTTCGACCCAGCGCyUiATCACAAGTCGAA 
AGAGGTGTAGCTGTCCCTCATGGTAACCATTACCA C 7TTATCCCTTATGAACAAATGTCTGAATTGGAAAAACGAAT 

35 TGCTCGTATTATTCCCCTTCGTTATCGTTCAAACCATTGGGTACCAGATTCAAGACCAGAACAACCAAGTCCACAAT 
CGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAATCCTCAACCAGCTCCA^ 

TTGGTCAAAGAAGCTGTTCGAAAAGTAGGCGATGGTTATGTCTTTGAGGAGAATGGAGTTTCTCGTTATATCCCAGC 

CAAGGATCTTTCAGCAGAAACAGCAGCAGGCATTGATAGCAAACTGGCCAAGCAGGAAAGTTTATCT^ 

GAGCTAAGAAAACTGACCTCCCATCTAGTGATCGAGAATTTTACAATAAGGCTTATGACTTACTAGCAAGAATTCAC 

40 CAAGATTTACTTGATAATAAAGGTCGACAAGTTGATTTTGAGGCTTTGGATAACCTGTTGGAACGACrCAAGGATGT 
CCCAAGTGATAAAGTCAAGTTAGTGGATGATATTCTTGCCTTCTTAGCTCCGATTCGTCATCCAGAACGTTTAGGAA 
AACCAAATGCGCAAATTACCTACACTGATGATGAGATTCAAGTAGCCAAGTTGGCAGGCAAGTACACAACAGA^ 
GGTTATATCTTTGATCCTCGTGATATAACCAGTGATGAGGGGGATGCCTATGTAACTCCACATATGACCCATAGCCA 
CTGGATTAAAAAAGATAGTTTGTCTGAAGCTGAGAGAGCGGCAGCCCAGGCTTATGCTAAAGAGAAAGGTTTGACCC 

45 CTCCTTCGACAGACCATCAGGATTCAGGAAATACTGAGGCAAAAGGAGCAGAAGCTATCTACJ^CCGCGTGAAAG^ 
GCTAAGAAGGTGCCACTTGATCGTATGCCTTACAATCTTCAATATACTGTAGAAGTCAAAAACGGTAGTTTAATC^ 
ACCTCATTATGACCATTACCATAACATCAAATTTGAGTGGTTTGACGAAGGCCTTTATGAGGCACCTAAGGGGTAT^ 
CTCTTGAGGATCTTTTGGCGACTGTCAAGTACTATGTCGAACATCCAAACGAACGTCCGCATTCAGATAA 
GGTAACGCTAGCGACCATG7TCAAAGAAACAAAAATGGTCAAGCTGATACCAATCAAACGGAAAAACCAAGCGAGGA 

50 GAAACCTCAGACAGAAAAACCTGAGGAAGAAACCCCTCGAGAAGAGAAACCGCAAAGCGAGAAACCAGAGTCTCCAA 
AACCAACAGAGGAACCAGAAGAATCACCAGAGGAATCAGAAGijVCCTCAGGTCGAGACTGAAAAGGTTGAAGA;^^ 
CTGAGAGAGGCTGAAGATTTACriTGGAAAAATCCAGGATCCAATTATCAAGTCCAATGCCAAAGAGACTCTCACAGG 
AITAAAAAATAATTTACTATTTGGCACCCAGGACAACAATACTATTATGGCAGAAGCTGAAAAACT 
TAAAGGAGAGTAAGTAA 

55 

MKINKKYIJ^GSVAVTALSVCSYELGRHOAGODKXESNRVAYIDGDQAGQKAENLTPDEVSKREGINAEQIVIKITDQ 
G YVTSHGDHYHY YNGKVP YDAI I SEELLMKDPNYQLKDSD I VNEI KGG YV I KVDGKYYVYLKDAAHADNI 5tTKEEI K 
RQKOERSHNHGSGANDHAVAA?^OGRYTTODGYIFNASDIIEDTGDAYIVPHGDHYHYIPKNELSASEIiAA 
NGKQGSRPSSSSSYNANPAOPRLSENHNLTVTPTYHONQGENISSLLRELYAKPLSERHVESDGLIFDPAQITSRTA 
60 RGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSNHWVPDSRPEQPSPOSTPEPSPSPQPAPNPQPAPSNPIDEK 
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LVKEAVRKVGDGYVFESNGVSRYIPAKDLSAETAAGIDSKIAKOESLSHKIXSAKK^ 
QDIiUDNKSRQVDFEAIJDMLLERLKDVPSDKVKX/VDDIIJ^lAPIRHPERLGK^ 
GVI FDPRDITSDEGDAyVTPH>riHSHWI KKDSI^EAERAAAQAYAKEKGLTPPSTDHODSGOTE^ 
AKKVPlJ^iy^PYNI^rrVEVKNGSLIIPHYDHYHNIKFEWFDSGLYEAPKGyTI^ 
5 GNASDHVQRNKNGQADTNQTEKPSEEKPQTEKPEEETPREEKPQSEKPESPKPTEEPEESPEESEEPQVETEKVEEK 
LREAEDLLGKI QDP 1 1 KSNAKETLTGLKNNLLFGTQDNNTIMAEAEiaLALLKESKZ 



ATGGAGGGATTGGTTAGAGTGCATTTATTGCCTGTATTTGGCGATTACAAGCTATCTAAACTTACTACGCCT 

10 tcaacagcaagtaaacaaatgggctgacaaggcaaataaaggcgaaaaaggggcatct 

ataacatgaataagcgtattitgaaatatggcgtagctatccaggtaatacaatacaacccagctaatc^ 
gttccacgcaaacagcaaaaagaaaaggctgctgtcaaatacttagacaacaaagaa 
tttagatgctctggatcaatcaaattatgagaacttatttgatgttgttctgtat 
gccgtattagtgaggctctggctcttgaatggtctgatattgacctagaaagcggtgttatcagcatcaat;^ 

15 ctaaaccgctatcaggaaataaactcacctaaatcaagcgctggttatcgtgatataccaatagacaaagccaa 
acttttactgaaacaatacaaaaaccgtcaacaaattcagtcttggaaattaggccgatctga;^ 
ctgtatttacggagaaatatgcttatgcttgtaacttacgcaaacgcctaaataagcatt^ 
actaacgtatcatttcatggtttccgccy^tacacatactactatga1?gctctatgctcaggr^ 
tcagtatagattaggccactctaattta?vtgatcactgaaaatacttactggcatactaaccaagaga^ 

20 aagccgtctcaaattatgaaacagctatcaacaatttataa 

meglvrvhllpvfgd ykls klttp i lqoqvnkwadkankge kgaf anys llhnmnkri lkygvai qvi q ynpandv i 
vprkqokekaavkyudnkelkqfij^yldax^sotenlfdwlyktliatgcrisea^^ 
lmyqeinspkssagyrdipidkatllllkoyknrqqiqswkl.grs£twfsvftekyayacnlrkri^ 
25 tkvsfhgfrhthttmmlyaqvspkdvoyrlghsnxj^itentywhtnqenakkavsnyetainnlz 



